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Prevention is as good as acure 


Priorities for the US Cancer Moonshot Initiative face an uncertain funding future — but it must not 
ignore proven prevention programmes in favour of glitzy research. 


President Donald Trump will and wont do, one thing is clear: 

some Republican lawmakers feel no compulsion to govern 
until he takes office in January. This could leave funding for some of 
President Barack Obama’ scientific mega-initiatives — including the 
Precision Medicine Initiative and the Cancer Moonshot Initiative — in 
limbo until next year. At present, there is a push in Congress to delay 
any decisions on the budget and pass a continuing resolution that will 
hold funding levels steady until March. 

The Cancer Moonshot Initiative, one of the newest schemes on the 
block, had been scrambling to get its goals in order ahead of the budget 
decision in time to impress lawmakers with its worthiness. But it could 
face more months of uncertainty — a harsh outcome for an initiative 
that had already committed to an aggressive timeline of doubling the 
pace of cancer research in five years. 

In recent months, the moonshot’s architects have laid out an 
ambitious menu of projects, and prioritizing those was always going 
to be a challenge. Given the absence of a clear fiscal future, deciding 
which projects take priority has become even more crucial. 

How to choose? It may be tempting to select and focus on the 
most superficially impressive and eye-catching science. But one of 
the moonshot' greatest strengths has been its willingness to take on 
not only cutting-edge research, but also the more mundane hurdles 
that can block the translation of findings to the clinic. Those low-tech 
projects should continue to receive emphasis, even as project leaders 
grapple with financial reality. 

When the moonshot's advisory panel of cancer experts released its 
ten scientific recommendations in September, for example, it high- 
lighted the expanded use of proven cancer-prevention techniques. 
In doing so, it drew welcome attention to these techniques and how 
improving their deployment could reduce deaths — from cervical 
cancer by 90%, from colorectal cancer by as much as 70% and from 
lung cancer by up to 95%. 

The tools are at our fingertips, if only society could muster the 
means to use them. Take smoking — it is still on the rise worldwide, 
despite its clear link to cancer. In 2015, about 30% of US cancer deaths 
were from lung cancer. By 2030, smoking is projected to kill 8 million 
people globally each year. Researchers need to learn more about the 
barriers that have held back efforts to reduce these numbers, and how 
best to implement strategies for stopping smoking. 

Similarly, colorectal cancer could be reduced through wider use of 
recent advances in non-invasive screening methods. But not many 
physicians are aware of these techniques, and too few people who are 
at risk of the disease are targeted for early screening. 

Many cancers caused by human papillomavirus (HPV) are now 
preventable with a vaccine given to adolescents. However, uptake of 
the vaccine still falls below targets, in part because of parental concerns 
and misconceptions about the risks of vaccination. 


A s the United States and the wider world tries to work out what 


Following the September report, the Moonshot Task Force — a 
separate group looking at how government agencies can work together 
to accelerate cancer research — released its own recommendations. 
These, too, included unglamorous but vital initiatives, such as a push 
for stop-smoking strategies in people who receive Medicaid, a US 
health-care plan for those with limited resources. And it announced 

an offer from some car-sharing companies 


“When budgets to give discounts to cancer patients travel- 
are constrained, ling to and from doctor's appointments. This 
itis important helps to tackle one of many logistical hurdles 
not to let the that contribute to low public participation in 
glitzy outshine clinical trials. 


The more high-tech recommendations 
from the Moonshot Initiative are also worthy 
projects. They include large-scale genetic analysis of tumours to push 
forward the fledgling field of personalized medicine, and clinical- 
trial networks to better harness the information gleaned from trials 
of immunotherapies and treatments for paediatric cancers. 

When budgets are constrained, it is important not to let the glitzy 
outshine the familiar. Too often in health research, straightforward 
prevention programmes, and the social science needed to implement 
them correctly, are pushed aside in favour of the basic research that 
fills this journal. A balance must be struck: the Cancer Moonshot and 
other initiatives should continue to emphasize the uptake of proven 
prevention strategies, even if the next administration tightens funds 
further than expected. = 


the familiar.” 


Daunting data 


The power of big data must be harnessed for 
medical progress. But how? 


volume with all the stars in the firmament. And in the seductive 
potential of its exponential, uncontrolled, ungraspable growth 
to improve out lives: by allowing medical treatments to be developed 
and approved more quickly — and, ultimately, truly personal medicine. 
But at a workshop held in London by the European Medicines 
Agency earlier this month, just how much science has to happen to 
make this beautiful future a reality was apparent to all. Patient groups 
and research scientists attended, alongside computational heavyweights 
from IBM Watson Health and Google Cloud Platform. Together, they 
tackled chewy questions to which there are few answers. 
How many data are ‘enough’ to reliably predict clinical effect? Which 
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data sets can be useful? How can they be managed? What's the best 
way to win the confidence of public and regulators? And, crucially, is 
academia training enough mathematicians and medical-data scientists, 
who will have to develop and harness all this new potential? The last of 
these questions at least has a clear answer: no. 

Big data sets in medicine include genomics, transcriptomics and 
proteomics (which, respectively, describe our genomes, identify which 
of our genes are being expressed and catalogue all of the proteins in a 
tissue sample). Genomic data sets alone have already shown their value. 
The presence or absence of a particular gene variant can put people 
in high- or low-risk groups for various diseases and identify in some 
cases which people with cancer are likely to respond to certain drugs. 

Buta single molecular data set will not contain sufficient information 
to tell the whole story of an individual's medical fate. Integration of 
different types of molecular data might tell more, but that remains a 
computational challenge. Even more would emerge if an individual’s 
molecular data were placed in the context of their physiology, behav- 
iour and health. Electronic health records, the numbers of which are 
skyrocketing, could be useful here. So could disease registries, hospi- 
tal and health-insurance records, as well as research publications and 
clinical-trial data. New types of data set come from wearables and apps, 
which collect health data directly from individuals, and from genome 
sequences of volunteers. Some researchers are even trying to extract 
medically relevant data from social-media platforms such as Twitter. 

This adds up to a mind-boggling volume of data. According to one 
estimate at the workshop, clinical data from a single individual will gen- 
erate 0.4 terabytes of information per lifetime, genomics data around 
6 terabytes and additional data, 1,100 terabytes. By 2020, the amount of 
health-related data gathered in total will double every 73 days. Health 
professionals will confront more data than do those in finance. 

To collect and hold all of this information within strict privacy 
regulations — non-negotiable for medical data — is another challenge. 


Some tech firms, such as IBM Watson Health and Hewlett-Packard, 
are building systems that keep data local — algorithms can dip 
into them, but the data are not transmitted anywhere else. Google, 
unsurprisingly, thinks that all these data are safer on the Cloud. 

The big question for scientists is how to take the next step to convert 
these artistic sketches of potential into scientific knowledge. Data sets 
vary in their reliability. Those derived from skimming social media 
will be very messy and will have to prove their usefulness. And large 
data sets, reliable or not, inevitably throw up spurious correlations, 

and so recognizing meaningful patterns 


“Health requires deep understanding of biology, 
professionals something that software developers don't 
will confront generally have. Future biologists — funders 
more data than and universities should note — will need 
do those in much more training in mathematics and 
finance. id data science. 


Big data has the exciting potential to allow 
clinical trials to be conducted partly in silico — which would mean 
using fewer animals in drug testing as well as recruiting fewer patients 
to the actual trials. Yet the field is developing at a time when public trust 
in experts is at an all-time low. 

Regulators are ready to be persuaded to accept computational 
information in clinical trials. On occasion, the European Medicines 
Agency and the US Food and Drug Administration already accept it 
in pharmacokinetics, one of the simplest data sets that drug develop- 
ers must supply agencies with. But the trust of doctors, patients and 
regulators in the abstract informatics and mathematical know-how that 
go into developing scientific and clinical predictions cannot be taken 
for granted. As medics and researchers nudge the field gradually from 
poetry and seduction to delivery, they must engage the next generation 
of scientists — and the public — and inform and educate them in the 
art and science of what is possible. m 


Source of hope 


More funding for the Middle East X-ray factory 
could give it a cafe — and sow the seeds of peace. 


quashed an uprising in Czechoslovakia, there was a place where 
citizens from the two cold-war superpowers could meet: the 
CERN cafeteria. To this day, the same spot at Europe’ particle-physics 
laboratory near Geneva, Switzerland, is a rare venue for Pakistani and 
Indian physicists to have a coffee, or for engineers from Israel and Iran to 
share ideas — no doubt fuelled by the huge lunches and inspiring views. 
There is no better cure for cross-border acrimony than meeting 
someone from the other country and realizing that they are not so 
different from yourself. And major science facilities, whose inhabit- 
ants already share many beliefs, such as in openness and the scientific 
method, are natural places to kindle such mutual understanding. 
What a chance, then, for SESAME. Synchrotron-light for Experi- 
mental Science and Applications in the Middle East is a venture that 
brings together Bahrain, Cyprus, Egypt, Iran, Israel, Jordan, Pakistan, 
the Palestinian Authority and Turkey, to build the region’s first 
synchrotron-light facility near Amman in Jordan. Proposed in 1997, 
modelled on CERN and created under the auspices of the United 
Nations Educational, Scientific and Cultural Organization (UNESCO) 
in 2002, the remarkable collaboration is now set to open (see page 475). 
SESAME has struggled at every turn. Several member states have 
made only partial payments, and its opening has been delayed by more 
than five years owing to financial difficulties. Other hurdles were less 
predictable: 2009 and 2010 saw Iranian council members assassinated, 
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and in 2013, the facility’s roof fell in after unprecedented snowfall. 

But SESAME has also ridden on a wave of generosity and persever- 
ance. Its director and various advisers contribute their time for free. 
Facilities in Germany, France, the United Kingdom, Sweden, Spain 
and Switzerland have donated crucial parts of the machine, and Jordan, 
which provided land for the site at no cost, has continued to cough 
up funding, even as the neighbouring Syrian war has brought huge 
numbers of refugees across its borders. 

To get this far is a terrific success. But when it opens next year, the 
facility will have just two beamlines, rather than a planned four, and will 
be without an administration building, dormitory, major library, audito- 
rium or visitor centre. Nor, sadly, will SESAME have its hub of scientific 
life, a cafeteria. Experimental equipment takes priority, but in a facility 
where time is so precious that researchers allotted 24 hours of beam time 
barely sleep, some worry that, with nowhere to grab a coffee and a seat, 
scientists from conflicting countries may pass like ships in the night. 

A group from the Sharing Knowledge Foundation in Geneva is try- 
ing to raise €30,000 (US$32,000) through public donations by the end 
of this year towards creating a SESAME cafeteria. But funds for both 
the cafe and the science could also come from another source, which 
has been conspicuously lacking over the years: the United States. 

A Congress ruling in 2011 cut funding to UNESCO, after the 
organization formally recognized Palestine as a member state. This is 
a public-relations difficulty for SESAME, even though there should be 
no official block because it is independent from UNESCO. A bigger 
question could be whether the incoming US president meddles with 
the Iran nuclear deal, which could restore sanctions. 

Still, brokering peace in the Middle East is something that Donald 
Trump said on the campaign trail he'd “really like to do”. For less than 
the cost ofa return flight to Tel Aviv on Air Force One, his adminis- 
tration could help SESAME to develop from a bare-bones scientific 
laboratory into a true beacon of peace and cooperation. m 
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environmental regulation in the United States, Europe is 
poised to take a significant and possibly decisive step on how 
to regulate endocrine-disrupting chemicals (EDCs). These chemicals 
are everywhere — in food, personal-care products, electronics and 
furniture — and are widely detected in human blood and urine at levels 
known to affect health. Yet action on them lags behind controls on haz- 
ards such as carcinogens. Early next month, European Union member 
states will take an important step when they review criteria proposed 
by the European Commission for identifying and regulating EDCs. 
Many pesticides contaminate foods and disrupt hormonal functions 
that are critical for brain development. Once EDC criteria are formal- 
ized, removal of these pesticides could help to prevent autism and loss 
of cognition, which have been linked to exposures in utero. The EDC 
criteria will also set scientific precedents for other 
national and global chemical policies. 
The state of the science and policy discussions 


A s the election of Donald Trump promises a bonfire of 


on EDCs are similar to those around climate THE 


EUROPEAN 


PARLIAMENT 
SHOULD PROTECT 


HUMAN 
HEALTH. 


change a decade ago. Research has suggested a 
more than 99% probability that these chemicals 
contribute to disease and disability. International 
bodies, including the World Health Organiza- 
tion (WHO), the United Nations Environment 
Programme and the International Council on 
Chemical Management, list them as an emerg- 
ing public-health concern. The effects of EDCs 
cross the entire lifespan, with disease burden and 
costs of US$217 billion annually in Europe and 
$340 billion in the United States (T. M. Attina 
et al. Lancet Diabetes Endocrinol. http://doi.org/ 
bs55; 2016). Even the reinsurance industry has advised its clients to 
reduce financial exposure related to the manufacture and use of EDCs. 

In response to this evidence, lobbyists and environmentalists have 
traded rhetoric and warnings. And as with the debate on climate change, 
a small group of scientists — many with well-documented links to 
industry — have endeavoured to manufacture a level of doubt that is 
out of proportion to the level of scientific disagreement. 

The scientists who deny endocrine disruption and dismiss the expert 
reviews on EDCs make many scientific inaccuracies and misrepre- 
sentations. Critics dismiss low-dose, nonlinear and non-monotonic 
exposure-response relationships for EDCs, even though they are well 
documented. They select studies with contaminated controls and other 
methodological problems to claim limited effects. They have argued that 
many studies of EDCs are based on correlation, not causation. 

Endocrine-mediated adverse outcomes are complex. Too often lost in 
the debate, however, is that findings about the impacts of EDCs carefully 
control for confounding factors. Results in humans are consistent with 
those from the laboratory, strengthening the evidence for causation. 

What should be discussed when the criteria are reviewed next month? 


Stand firm on 
hormone disruptors 


Ahead of akey meeting on endocrine-disrupting chemicals, 
Leonardo Trasande argues that policy must follow the science. 


The WHO defines EDCs as “exogenous compounds or mixtures that 
alter function(s) of the endocrine system and consequently cause 
adverse effects in an intact organism, or its progeny, or (sub)populations”. 
With a seemingly innocuous edit, the European Commission's draft 
criteria change the word “consequently” to “are known to” — placing 
too heavy a burden of proof for a chemical to be classified as an EDC. 

Unlike carcinogens, mutagens and reproductive toxicants, which 
can be identified from animal studies under EU law, the draft EDC 
criteria require human data on health effects. Given that these can 
arise years (if not decades) after exposures in early life, an entire 
generation could suffer the health consequences ofa regulatory delay. 
Animal and laboratory studies should be admissible. Arguably, this is 
the most important change needed. 

Another major error by the commission is to misrepresent the 
crucial distinction between hazard and risk 
under EU law. The distinction means that con- 
siderations of potency — the traditional expo- 
sure-response effect — should not enter into the 
decision on whether, say, a pesticide exposure 
presents a hazard. This important principle is cru- 
cial to underwriting the regulation of endocrine 
disruptors, and was sealed as scientific consen- 
sus at an April meeting in Berlin. This meeting 
united the vocal minority of scientists with the 
leading authors of a scientific statement by the 
Endocrine Society (A. C. Gore et al. Endocr. Rev. 
http://doi.org/bs69; 2015). Participants agreed 
that the WHO definition was adequate, and that 
potency is not relevant to the identification of 
hazards such as EDCs. 

The European Parliament should use science-based criteria to 
protect human health. EDC criteria should acknowledge the evolv- 
ing weight of evidence of a chemical’s disruption of hormones and its 
contribution to adverse outcomes. This approach would allow chemi- 
cals to be designated as EDCs; suspected EDCs; endocrine-active 
substances; and endocrine-inactive substances. Like the approach used 
to re-evaluate potential carcinogens and reproductive toxicants, the 
designation can be reconsidered as new evidence emerges. 

Some authors of the Berlin consensus statement continue to argue 
against the need for public-health protections, and have resorted 
to personal attacks, labelling peer-reviewed academic research as 
pseudoscience. In contrast to the US media, which has been criticized 
for its difficulty in discerning fact from fiction in the election cam- 
paign, we must stand firmly to defend scientific norms. The alternative 
is that public mistrust makes anti-scientific alternatives acceptable. m 


Leonardo Trasande is associate professor of paediatrics at New York 
University School of Medicine, USA. 
e-mail: leonardo. trasande@nyumc.org 
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Homing in ona 
fast radio burst 


The origins of powerful, 
millisecond-long radio 
pulses from space called fast 
radio bursts (FRBs) remain 
a mystery. But researchers 
studying the brightest FRB 
seen so far have zeroed in on 
its location more accurately 
than ever before. 

Vikram Ravi at the 
California Institute of 
Technology in Pasadena, 
Ryan Shannon at CSIRO 
Astronomy and Space Science 
in Marsfield, Australia, and 
their team observed an FRB 
in real time using the Parkes 
Observatory radio telescope 
in Australia. By modelling the 
FRB’s appearance in two of 
the telescope’s 13 overlapping 
beams, the team constrained 
the origin of the FRB to just a 
handful of possible locations 
more than 1.5 billion light years 
(0.46 billion parsecs) away. 

Pinpointing a burst’s location 
should reveal its distance and 
allow astronomers to learn 
about the matter between 
galaxies, say the authors. 
Science http://doi.org/bs66 
(2016) 


Rapid recovery 
boosts plant yields 


In strong sunlight, plants give 
off excess energy as heat to 
prevent damage, but when 
the light dims they are slow 
to switch off this process, 
reducing photosynthesis. 
Now researchers have shown 
that increasing the pace at 
which plants resume full 
photosynthesis can boost their 
yields by about 15%. 

Stephen Long of the 
University of Illinois in 
Urbana, Krishna Niyogi of 
the University of California, 


POPULATION GENETICS 


Clues to shy sharks in seawater DNA 


Traces of DNA in seawater can provide 
new insight into populations of the elusive 


whale shark. 


DNA found in the environment (eDNA) 
can come from the dead skin cells and faeces 
of animals. Philip Francis Thomsen at the 
University of Copenhagen and his colleagues 
compared eDNA sequences from seawater 
off the coast of Qatar with mitochondrial 
DNA sampled from a group of whale sharks 
(Rhincodon typus; pictured) in the same 
location. The two sets of DNA showed similar 


Berkeley, and their colleagues 
engineered tobacco plants 
(Nicotiana tabacum) by 
increasing the expression 

of three genes involved in 
protecting plants from bright 
light — PsbS, VDE and ZEP. 
They found that photosynthesis 
recovered more quickly under 
fluctuating light conditions 
than in unengineered plants. 
The dry weight of engineered 
plants grown in the field was 
14-20% higher than that of 
normal plants. 

This could bea way to 
increase the productivity of 
food crops, the authors say. 
Science 354, 857-861 (2016) 
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genetic patterns, but the eDNA included 
extra sequences — suggesting that eDNA may 


reveal greater genetic diversity. Comparing 


authors say. 


Catalyst clicks 
drugs together 


A new type of catalyst 
assembles cancer drugs from 
benign components, and 
can work inside laboratory 
animals. 

Some drug molecules can be 
split into smaller parts that are 
then equipped with specific 
chemical groups. These groups 
allow the components to be 
put back together in a ‘click 
reaction in the presence of a 
catalyst. This could reduce the 
side effects of cancer drugs by 


the eDNA data with those from whale-shark 
groups worldwide showed that the Qatari 
animals are similar to other Indo-Pacific 
groups, which are genetically distinct from 
Atlantic populations. 

Using eDNA is a powerful way to study 
population dynamics in marine animals, the 


Nature Ecol. Evol. 1,0004 (2016) 


rendering them inactive until 
they reach the tumour where 
they are reassembled — a 

step that previously required 
a toxic copper-ion catalyst. 
Mark Bradley of the University 
of Edinburgh, UK, and his 
colleagues developed a non- 
toxic alternative by growing 
copper nanoparticles inside a 
polymer resin. Their catalyst 
assembled an anti-tumour 
drug that killed cancer cells in 
culture. 

The catalyst was also active 
and non-toxic in tests in 
zebrafish embryos. 

Angew. Chem. Int. Edn http://doi. 
org/f3s7ct (2016) 
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BIOCHEMISTRY 


A fast way to 
convert carbon 


Researchers have come up 
with a biochemical pathway 
that converts carbon 
dioxide into useful organic 
compounds more efficiently 
than plants can. 

A team led by Tobias Erb at 
the Max Planck Institute for 
Terrestrial Microbiology in 
Marburg, Germany, assembled 
a series of chemical reactions 
using 17 redesigned enzymes 
from 9 different organisms. 
The main enzyme used by 
plants in photosynthesis 
makes the process of CO, 
fixation relatively slow, but 
using other enzymes from 
bacteria, plants and humans, 
the researchers developed a 
quicker and more energy- 
efficient pathway. 

This process could have 
many applications, such as 
boosting CO, fixation in 
photosynthetic organisms, the 
authors say. 

Science 354, 900-904 (2016) 


Low-fibre diet 
puts gut at risk 


Mice eating a low-fibre diet 
have a higher risk of bowel 
infection, thanks to bacteria 
that normally live in the gut. 
Eric Martens at the 
University of Michigan 
Medical School in Ann Arbor 
and Mahesh Desai, now at 
the Luxembourg Institute of 
Health, together with their 
colleagues, compared the 
effects of fibre-poor and 
fibre-rich diets in mice that 
lacked their own bacteria and 
were given a mix of 14 species 
of human gut bacteria. These 
microbes normally consume 
carbohydrates from dietary 
fibre, but without these 
nutrients, the bacteria instead 
degraded the mucus barrier 
that lines the intestinal wall. 
Thinning of this protective 
layer (pictured as indicated 
by white arrows in the right 
panel; left panel shows 
normal layer) exposed the 


intestinal surface to attack 
by disease-causing bacteria. 
More than half of the mice 
eating a low-fibre diet lost 
at least 20% of their body 
weight after infection with 
the pathogen Citrobacter 
rodentium. 

Cell 167, 1339-1353 (2016) 


DNA reveals 
history of maize 


Maize (corn) that grew more 
than 5,000 years ago was 
genetically more similar to 
today’s maize than to its wild 
counterpart, suggesting that 
the plant was already being 
domesticated. 

Nathan Wales and Jazmin 
Ramos-Madrigal of the 
University of Copenhagen 
and their team sequenced the 
genome of a 5,310-year-old 
maize cob (Zea mays) 
excavated in the 1960s in 
Mexico. They found a gene 
variant responsible for 
producing kernels that lack a 
hard seed coat and are easy to 
eat — probably the first step 
in domestication. However, 
other genes typically found 
in today’s maize were absent, 
including one that prevents 
maize ears from shattering 
when ripe. 

Curr. Biol. http://doi.org/btcs 
(2016) 


‘Mini-guts’ made 
with nerves 


Miniature tissue structures 
resembling a human intestine 
with functional nerves 
can mimic the involuntary 
contractions that push food 
through the gut. 

James Wells and Michael 
Helmrath of the Cincinnati 
Children’s Hospital Medical 


RESEARCH HIGHLIGHTS BiiiSaiiaae 


Center in Ohio and their 
team generated nerve and 
intestinal cells from human 
stem cells. They grew the 

mix for 28 days to make 3D 
structures, or organoids, 

that had intrinsic nerve cells. 
They then transplanted the 
organoids into mice to form 
mature nerves. A single electric 
impulse triggered a prolonged 
wave of contractions in the 
organoids that had a nervous 
system, but did not do so in 
structures lacking nerves. 
Using the model, the group 
identified molecular pathways 
involved in Hirschsprung’s 
disease, a rare disorder of the 
large intestine. 

These organoids could be 
used to study other intestinal 
diseases, the authors say. 
Nature Med. http://dx.doi. 
org/10.1038/nm.4233 (2016) 


ECOLOGY 


Ants farm plants 
in Fiji 
A species of ant in Fiji survives 
by farming six plant species. 
The ants nurture seedlings 
and then live in the cavities of 
the plants. 

Guillaume Chomicki 
and Susanne Renner of the 
University of Munich in 
Germany found that Philidris 
nagasau ants carry seeds of 
six species of Squamellaria 
plant and insert them into 
cracks in trees, where they 
germinate (pictured). The 
ants also fertilize the seeds 
with their waste, and are 
only found living near these 
plants. 

Although ants are known 
to farm fungi, this is the first 
time they have been found 
to plant seeds and actively 
cultivate them. Examining 
the family trees of relatives of 
these ants and plants suggests 
that this mutually dependent 


relationship evolved around 

3 million years ago. 

Nature Plants http://dx.doi.org/ 
10.1038/nplants.2016.181 (2016) 


| GENOMICS 
Epidemics marked 
indigenous DNA 


Epidemics from Europe that 
killed thousands of indigenous 
Canadians in the nineteenth 
century have left their 
signatures in the genomes of 
people in Canada today. 

Ripan Malhi at the 
University of Illinois in 
Urbana-Champaign, Michael 
DeGiorgio at Pennsylvania 
State University in University 
Park and their colleagues 
sequenced the genomes of 
25 modern Tsimshian people, 
who live in coastal British 
Columbia and Alaska and are 
among Canada’s First Nations. 
The authors compared the 
genomes with the DNA of 
25 people who lived in the same 
region between 6,000 and 1,000 
years ago. They found that 
most of the modern Tsimshian 
people had different versions 
of immune-system genes, 
including an HLA gene, from 
their ancestors. Moreover, the 
team found an overall decrease 
in genetic diversity, which 
probably signifies a sudden 
population decline of about 
57% around 175 years ago. 

This is close to historical 
accounts that suggested a 70% 
decrease due to a smallpox 
epidemic. 

Nature Commun. 7, 13175 (2016) 
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HEALTH 


Zika virus trial 

A trial that would release 
genetically modified 
mosquitoes in Florida to fight 
diseases such as the Zika virus 
can go ahead, the Florida Keys 
Mosquito Control District 
decided on 19 November. The 
initiative had been among 
those voted on by the state 

in non-binding referenda in 
the 8 November US election. 
The town of Key Haven, the 
proposed location of the 

trial, rejected the measure, 
but voters elsewhere in the 
broader area of Monroe 
County approved it. The 
mosquito board has the 
ultimate say. The experiment, 
which would be a US first, 
needs a new site and approval 
from the US Food and Drug 
Administration before it 

can begin. Separately, the 
World Health Organization 
announced on 18 November 
that the Zika virus no longer 
presents a health emergency of 
global concern. 


Malaria vaccine 


Vaccinations against malaria 
will begin in sub-Saharan 
Africa in 2018, the World 
Health Organization 
announced on 17 November. 
A handful of pilot projects 
will use the first ever malaria 
vaccine — called RT'S,S — 
against the world’s deadliest 
malaria parasite, Plasmodium 
falciparum. The projects 

will test the logistics of 
administering a vaccine that 
must be given to children 

4 times over 18 months, 

and that yields only partial 
protection. 


CRISPR in humans 


A Chinese team of researchers 
has become the first to inject 

a person with cells modified 
using the CRISPR-Cas9 gene- 
editing technique, with the 
aim of treating aggressive lung 


Peat initiative targets climate and health 


Peatlands cover only 3% of the global land 
surface, but contain about twice as much carbon 
as the entire biomass of the world’s forests. A 
global initiative launched on 17 November at the 
United Nations climate meeting in Marrakesh, 
Morocco, seeks support from governments and 
international organizations to scale up protection 


cancer. The participant is part 
of a clinical trial at the West 
China Hospital in Chengdu. 
See page 479 for more. 


POLICY 


Brexit and science 


An influential group of 
politicians has urged the UK 
government to guarantee that 
all European Union researchers 
living in the country can stay 
after it leaves the union. In 

a report on the impacts of 
Brexit on science, released 

on 18 November, the House 

of Commons science and 
technology select committee 
advised the government to set 
out a broader vision for science 
— including committing to 

a hefty raise in the country’s 
science spending. The 
comunittee also said that the 
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department in charge of Brexit 
should move swiftly to appoint 
a chief scientific adviser, a 
position currently unfilled 
and unadvertised. Separately, 
Prime Minister Theresa May 
announced on 21 November 
that her government would 
invest more in research and 
development — providing 
increases worth £2 billion 
(US$2.5 billion) per year by 
2020 — to keep Britain at the 
“cutting edge” 


Women only 

Ina bid to increase the 
proportion of women in high- 
ranking positions in academia, 
the Royal Netherlands 
Academy of Arts and Sciences 
will hold two special elections 
in 2017 to recruit 16 additional 
female scholars and scientists. 
Nominations for the first round 


of the organic carbon stock and restoration of 
drained wetlands. Drained and burned peatlands 
are responsible for about 5% of carbon emissions 
from human activity. Peat forest fires last year 

in Indonesia (pictured), one of the founding 
members of the initiative, have been linked to 
100,000 premature deaths in the region. 


of women-only elections, 

the results of which are to be 
announced in May, are open 
until 8 December. Currently 
only 13% of the academy's 
556 members are women. 
Normal annual elections for 
16 new members, male or 
female, will continue in parallel 
with the two extra rounds, the 
academy says. 


Climate concern 
Australia, Canada and the 
United States all risk breaking 
the climate pledges that they 
made in Paris a year ago, 
according to an analysis of 
past and present action by 

the world’s largest national 
economies. The study by the 
London School of Economics 
and Political Science, published 
on 17 November at the United 
Nations climate meeting in 
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> Marrakesh, Morocco, finds that 
6 Argentina, Saudi Arabia and 
Turkey are likewise behind on 
meeting their climate goals. 
Only France, Germany and the 
United Kingdom are curbing 
their domestic greenhouse-gas 
emissions in line with their 
Paris targets; Brazil, China and 
Italy look set to almost keep 
their pledges, the authors say. 


Super satellite 


The Geostationary Operational 
Environmental Satellite-R 
Series (GOES-R), the United 
States’ most scientifically 
capable weather satellite 

yet, successfully launched 

on 19 November. GOES-R 

can take pictures every 30 
seconds, which is 5 times faster 
than current GOES satellites 
can, and in 4 times higher 
resolution. The rapid imaging 
will allow it to track severe 
storms, wildfire smoke and 
volcanic ash as they spread. 
GOES-R should be operational 
within a year. 


Quadruple launch 


A quartet of Galileo satellites 
successfully launched on an 
Ariane 5 rocket (pictured) 
from French Guiana on 

17 November, bringing 
Europe’ civil global positioning 
system close to completion. 
The 18 Galileo satellites now in 
orbit should allow the system 
to enter into preliminary 
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service by the end of the year. 
By 2020, a full constellation 

of 24 satellites along 3 orbital 
planes — plus 2 spares per 
plane — are to routinely 
transmit their position and the 
time, allowing users worldwide 
to locate objects on Earth with 


is endemic. The bacterium 
was first identified in Europe 
in 2013. 


ITER creeps on 


The council of the nuclear- 
fusion experiment ITER 
endorsed a radical change 

of plans on 17 November: 

the international project will 
now aim to store hydrogen 
plasma in its doughnut-shaped 
container by 2025, postponing 
full nuclear fusion until 2035. 
The plan was proposed in 2015 
by director-general Bernard 
Bigot in an attempt to save the 
seriously late and over-budget 
project. In spring 2016, two 
external reviews expressed 
cautious optimism about 
Bigot’s management changes 


extreme precision. and the pace of construction at 
the site in St-Paul-lez-Durance, 

Plant pathogen France. 

The deadly fastidiosa 

subspecies of the plant 


pathogen Xylella fastidiosa has 
been identified on the island of 
Mallorca, Spain. It is the first 
appearance of this subspecies 
in outside environments in 
Europe; in April, it had been 
discovered in oleanders in a 
greenhouse in Germany. The 
subspecies is different from 
that responsible for the recent 
devastation of olive groves in 
Puglia, Italy — but the same 
as that which wreaked havoc 
last century in grapevines in 
California, where X. fastidiosa 


John Maddox prize 
Human-memory researcher 
Elizabeth Loftus has been 
awarded the 2016 John 
Maddox Prize. Loftus, at the 
University of California, Irvine, 
has worked on the cognitive 
psychology of false memories 
and unreliable eye-witness 
testimony. She has influenced 
the US legal system, suffered 
severe harassment in reaction 
to her work and highlighted 
the vulnerability of academics 


CLIMATE PLEDGES FALL SHORT OF 2°C TARGET 


2016; GO.NATURE.COM/2F27ZGO 


SOURCE: IEA. WORLD ENERGY OUTLOOK 


TREND WATCH 


The carbon budget to keep global 
warming below 2°C could be used 
up by 2040 under current climate 
policies, says the International 
Energy Agency in its latest World 
Energy Outlook. The report makes 
projections of global energy- 
related carbon dioxide emissions 
up to 2040 under three scenarios 
(see chart). Enforcing pledges in 
the 2015 Paris climate deal offers 
only a few years respite. An early 
peak and subsequent decline in 
emissions in line with meeting the 
2°C target would require extra 
efforts, the agency says. 


Current and probable policies to curb carbon emissions will fail to limit 


warming to 2°C, according to the 2016 World Energy Outlook. 


Global energy-related 
COz emissions (billion tonnes) 


{) Current-policies scenario {J Intended-policies scenario 
i 2°C scenario 


NO 
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2014 2020 2030 


Current policies are those enacted as of mid-2016; 
intended scenario includes announced policies that will probably be enacted. 
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SEVEN DAYS | THIS WEEK | 


28 NOVEMBER- 

2 DECEMBER 

Dark matter and 
gravitational waves are 
top of the agenda at 

the 13th International 
Symposium on 
Cosmology and Particle 
Astrophysics in Sydney, 
Australia. 
go.nature.com/2gcOluo 


1-2 DECEMBER 
Delegates from Canada 
and the 22 member 
states of the European 
Space Agency meet in 
Lucerne, Switzerland, 
to discuss the agency's 
focus for the coming 
years. 
go.nature.com/2f7wmuk 


in the face of legal threats. The 
John Maddox Prize is awarded 
annually for championing 
sound science and evidence 

in the face of hostility and 
repression. It is sponsored 

by Nature and the London- 
based charities the Kohn 
Foundation and Sense About 
Science. It is named after the 
late John Maddox, a former 
editor of Nature. See go.nature. 
com/2gdrxxo for more. 


Texas tea 


A massive deposit of untapped 
hydrocarbon deposits, 
thought to be the largest ever 
in the United States, has been 
discovered in western Texas. 
The Midland Basin of the 
Wolfcamp Shale area contains 
an estimated 20 billion barrels 
of oil and 1.6 billion barrels of 
natural gas, the US Geological 
Survey said on 15 November. 
The agency has not assessed. 
whether it would be profitable 
to produce the ‘unconventional’ 
resources, which would have 
to be extracted using special 
methods. 


> NATURE.COM 
For daily news updates see: 
WwW.nature.com/news 
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The bizarre 


Minority scientists CRISPR gene- Researchers fight = @ 
rethink careers in wake of editing moves into the clinic to avert catastrophic chromosome evolution 
Trump win p.476 in China p.479 funding freeze p.480 ag of acommon bird p.482 


The SESAME accelerator will circulate electrons at high energies, creating beams of intense radiation that will be used to study cultural treasures. 


Middle Eastern X-ray 
factory readies for action 


SESAME project set to revolutionize science in the region, but is strapped for cash. 


BY ELIZABETH GIBNEY 


r The Middle East’s first major interna- 
tional research centre has weathered 
political unrest, international sanctions 

and even the assassination of two delegates. 

Now, the Synchrotron-light for Experimental 

Science and Applications in the Middle East, or 

SESAME, is on the brink of circulating its first 

subatomic particles. 

The machine, which lies outside Amman, 


Jordan, will start accelerating electrons around 
its 133-metre ring in December and begin 
using the resulting beams of intense radiation 
to do science from May 2017. 

Strapped for cash, SESAME will initially 
operate at half its planned capacity. But its 
location in the Middle East means that it will 
make a significant contribution to science. “It 
will set the stage for a culture of state-of-the- 
art technology and science in our region,’ says 
director Khaled Toukan. 


SESAME is a collaboration between Bahrain, 
Cyprus, Egypt, Iran, Israel, Jordan, Pakistan, 
the Palestinian Authority and Turkey. Created 
under the auspices of the United Nations Edu- 
cational, Scientific and Cultural Organization 
(UNESCO), it has also received funding from 
other countries and from the European Union. 

The machine will generate light in a range of 
wavelengths and then channel it into intense 
‘beamlines’ of specific frequencies (see ‘Open 
SESAME’). These can be used to reveal the 
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OPEN SESAME 


The SESAME synchrotron near Amman, 
Jordan, will start circulating beams in 
December. But it will initially use only 

a fraction of its potential capacity. 


MICROTRON 
Electrons accelerated to 
22 megaelectronvolts. 


BOOSTER RING 
Electrons further accelerated to 
800 megaelectronvolts. 


STORAGE RING 
Electrons circulate at 
2.5 gigaelectronvolts, and start to emit light. 


BEAMLINES 
Optical instruments tailor the light 
and use it to investigate samples. 


< 


> structure of materials, biological samples and 
artefacts down to the atomic scale. SESAME has 
space for 24 beamlines. The plan was initially 
to open with four of the slots filled, but a lack of 
funding means that it will open with just two — 
one infrared and one X-ray. 

“SESAME is ina part of the world where you 
have very difficult times politically, and a lot 
of instability, and the money just didn’t come,” 
says Toukan. Iran couldn't pay its contribution 
until January, owing to sanctions that prevented 
it from transferring money internationally. 
Cyprus has not paid its share of the running 
costs since it was hit by a financial crisis in 
2011, says Toukan, and Pakistan has paid only 
half of its dues this year. Jordan and Turkey are 
the only countries to have paid their annual and 
capital contributions in full, he says. Politics has 
touched SESAME, too: two Iranian physicists 
killed in car-bomb attacks in 2009 and 2010 
were members of its council. 

Still, engineers are expected to have finished 
testing the synchrotron by May, anda group of 
260 researchers, largely from universities in the 
Middle East and in fields from pharmacology 


Booster 


Metres in cireu® 


Just two beamlines 
will be active when 
SESAME opens. 


Beamlines 
ring 


Storage 


SESAME has space 
for 24 beamlines. 


Two more beamlines 
likely to get funding 


&@ 
ow 


to physics, will then be able to apply for time on 
the two beamlines. 

Toukan hopes that the facility will ease brain 
drain from the region, boost collaboration 
between the scientific community and indus- 
try and create opportunities for researchers who 
lack the funds to leave the Middle East. 

Because of its proximity to many of the 
world’s archaeological treasures, the facility 
presents a fresh opportunity for studying the 
region’s cultural heritage. Jan Gunneweg, an 
archaeometrist at the Hebrew University of 
Jerusalem, hopes that scientists will use SES- 
AME to collaborate on understanding their 
shared history. Many artefacts — such as Egyp- 
tian mummies and delicate papyrus — are frag- 
ile and must be insured at high cost if they are to 
travel long distances. “If that material has to go 
in the air, you destroy it,” says Gunneweg. 

He wants to use the synchrotron to further 
his studies of the composition, and therefore 
origins, of parchments including the Dead Sea 
Scrolls, the oldest known biblical texts. 

SESAME cost around US$110 million to 
build, just one-sixth of the price of the European 


Synchrotron Radiation Facility in Grenoble, 
France, one of the world’s most advanced syn- 
chrotrons. Pared-back ambition helped, but 
SESAME also relied on parts donated from 
dismantled European facilities, and was built 
on land that was given for free. 

Two more beamlines are due to be installed 
by the end of 2018. SESAME has secured $2 mil- 
lion from the Jordanian Scientific Research 
Fund for its third beam, which will perform 
protein crystallography. And Toukan is confi- 
dent that the collaboration will find funds for 
a fourth beam dedicated to materials science. 

Beyond these four beams, there is no set 
schedule for filling the remaining 20 slots. “A 
few tens of million of dollars could make this 
dream come true,’ says Roy Beck, a biophysicist 
at Tel Aviv University in Israel and a commit- 
tee member of the SESAME users’ group. He 
laments that more nations have not been willing 
to swallow national differences for the sake of 
science. Some Gulf countries will not take part 
because of Israel's participation, he says, and the 
United States has made only a small contribu- 
tion, which both Beck and Toukan attribute to 
political considerations (see page 468). 

Advocates hope that SESAME will foster 
peace in the same way that CERN, Europe’s 
particle-physics laboratory, near Geneva, Swit- 
zerland, helped to heal the wounds of the Sec- 
ond World War and brought Soviet and Western 
scientists together at the height of the cold war. 

But crucial factors in that success were 
CERN’s communal spaces, where scientists 
could share a coffee and get to know each 
other, says Beck. SESAME is scheduled to open 
without a cafeteria or dedicated accommoda- 
tion, although a committee is trying to raise 
US$32,000 in donations to create the former. 

“T hope people from all round will under- 
stand that this is a true chance for people within 
the Middle East to join hands and talk about 
things that unite them,” says Beck. m 


Immigrant and minority 
scientists shaken by Trump win 


Worries include job prospects, discrimination — and safety. 


BY HEIDI LEDFORD, SARA REARDON 
& RAMIN SKIBBA 


s the US presidential election results 
rolled in, Naglaa Shoukry watched 
a door slam shut. An immunologist 


at the University of Montreal in Canada, she 
had been contemplating a move to the United 


States in search of better research funding. But 
when Donald Trump clinched the presidency, 
she knew that she would probably not go. 
Shoukry, a Muslim from Egypt, did post- 
doctoral research in Ohio and was there 
when the United States tightened security 
after the terrorist attacks of 11 September 
2001. When Trump pledged to use “extreme 
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vetting” to determine which immigrants 
could enter the country, Shoukry recalled the 
humiliations her family experienced when 
travelling to see her in Ohio. “You have an 
interesting name,’ a US border official once 
told her brother, Mohamed, before detaining 
him for extra security checks. Under Trump, 
Shoukry decided, it would surely be worse. 


SCOTT EISEN/GETTY 


Shoukry is not the only scientist shaken by 
the result of the 8 November election, nor is 
she alone in reconsidering whether to work or 
study in the United States. Trump's campaign 
rhetoric was at times insulting to women, 
immigrants and under-represented minorities, 
and there are signs that his victory has further 
inflamed racial tensions. The Southern Poverty 
Law Center, a civil-rights advocacy group in 
Montgomery, Alabama, collected 437 reports 
of intimidation and harassment in the five days 
after the election — many of which explicitly 
referenced president-elect Trump. Fifteen per 
cent of the incidents reported to the centre 
took place at universities. 

Such events have led several universities and 
scientific societies to reaffirm their commit- 
ment to diversity. “Take concrete steps to pro- 
tect and advocate for colleagues and students 
who are particularly vulnerable right now,’ 
urges a letter signed by ten astronomers on the 
Astronomy in Color blog, which advocates for 
diversity in the field. “We must reiterate how 
absolutely essential it is to the core values of 
our community, and also to the well-being 
of our society and world, that all persons be 
treated with the dignity and respect they 
deserve,” wrote Amy Gutmann, president of 
the University of Pennsylvania in Philadelphia, 
when black undergraduates there received 
racist e-mails after the election. “We all stand 
together in solidarity with our Black students.” 

Still, scientists around the country have 
reported harassment. Monica Felit-Mojer, a 
science communicator originally from Puerto 
Rico, is nervous about speaking Spanish in 
some public places after hearing a passer-by 
shout “Build that wall!” — referencing Trump’s 
plan to build a wall along the US-Mexico 
border. The 12 November incident took place 
in San Diego, California, where Felit-Modjer 
was attending the Society for Neuroscience 
annual meeting. 

Patrick Freeman, an ecologist at the 
Carnegie Institution for Science in Stanford, 
California, was out driving on the day after 
the election when a person in a truck with a 
Trump bumper sticker started honking at 
him — perhaps because Freeman’s car sports 
a sticker in support of Trump’s Democratic 
opponent, Hillary Clinton, and of lesbian, 
gay, bisexual, transgender and queer people. 
The truck driver pulled up alongside Freeman 
and repeatedly pantomimed shooting him 
before speeding away. 

For some scientists, the post-election 
atmosphere has been stifling. Nicole Cabrera 
Salazar, an astronomy graduate student at 
Georgia State University in Atlanta who was 
born in Chile, says she is mindful that her field 
is dominated by white males — and that 63% 
of white males who voted supported Trump. 
Cabrera Salazar was once outspoken in support 
of marginalized students, but she has begun to 
hold back for fear of a backlash. “It’s a toxic 
environment for women of colour,’ she says. 


IN FOCUS 


Donald Trump has taken a hard-line stance on US immigration. 


Some foreign researchers even question 
whether to travel to the United States for con- 
ferences. Shaaban Mousa, an anaesthesiolo- 
gist at Charité University Hospital in Berlin, 
comes to the Society for Neuroscience meet- 
ing every year, but considered skipping it this 
year because of the election result. “I was not 
relaxed,” he said last week at the meeting. “I 
came because I have my ticket.” And Razi 
Nalim, an engineer at Indiana University- 
Purdue University in Indianapolis, worries that 
he'll have difficulty recruiting Muslim students 

and postdocs to work 
in his department. 
About 2% of sci- 
entists in the United 
States are Muslim 
immigrants, accord- 
ing toa survey of biol- 
ogists and physicists 
led by Elaine Howard 
Ecklund, a sociologist 
at Rice University in Houston, Texas. And 64% 
of them say they have experienced religious 
discrimination — a higher proportion than for 
any other racial or religious group in Ecklund’s 
study. The US Federal Bureau of Investigation’s 
annual hate-crimes report suggests that num- 
ber may grow: hate crimes against Muslims in 
the United States climbed by 67% last year. 

But several Muslim scientists contacted 
by Nature still feel comfortable in the United 
States. Some, such as a husband and wife stud- 
ying neuroscience at the Medical University 
of South Carolina in Charleston, were pleased 


to see Trump win. “It’s a good change for the 
US,” says the husband, who asked not to be 
named over concerns with his career. He and 
his wife say that they have never experienced 
racism or Islamophobia during their time in 
South Carolina — a state where Trump won 
55% of the vote — and they have been shocked 
by mass e-mails from their professors calling 
Trump supporters racist and hateful. 

Shoukry says that some of her Muslim 
colleagues in the United States are troubled by 
the thought of raising children in an increas- 
ingly hostile environment. Islam Hussein, an 
Egyptian virologist who is a research scientist 
at the Massachusetts Institute of Technology 
in Cambridge, says his children were nervous 
about going to school the day after the elec- 
tion. But after a decade in the United States, 
Hussein wants to stay. His family has always 
felt welcome, he says, and his wife, who wears 
a hijab, has a flourishing career as a pre-school 
teacher. After the election, friends rushed to 
offer the family support. “This is the spirit of 
America that we should all keep embracing,” 
Hussein told Nature on 14 November. 

The next day, Hussein contacted Nature 
again: his wife had just been accosted at their 
local pharmacy. “Why are you still here?” a 
man shrieked at her, then hopped into his car 
and fled. “I think he meant: “Why are you still 
here after Trump’s election??’ says Hussein. 
“This is the first explicit act of discrimination 
we have encountered in ten years.” = 


Additional reporting by Erika Check Hayden. 
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Asteroid crater probed 


Drilling into the Mexican site of the dinosaur-killing impact reveals how enormous 
craters form and finds shattered rock where underground life could thrive. 


BY ALEXANDRA WITZE 


rilling into ground zero of the asteroid 
D impact that killed off the dinosaurs 
66 million years ago has uncovered 
the origin of its mysterious ring of mountains. 

The drill core penetrated a circle of 
mountains, known as a peak ring, in Mexi- 
cos buried Chicxulub crater. Only the largest 
impacts are powerful enough to create peak 
rings. Understanding how these mountains 
formed at the 200-kilometre-wide Chicxulub 
could help to reveal how cosmic collisions 
shaped other bodies, such as the Moon and 
Venus. 

The work also shows how, despite killing 
the dinosaurs, the impact may have created an 
environment in which other life could thrive. 
The cosmic smash-up fractured rocks, open- 
ing up spaces and warm habitats for microbes 
to move in. 

“We got a core better than we could 
reasonably have imagined,” says David Kring, 
a geologist at the Lunar and Planetary Institute 
in Houston, Texas. “It is absolutely extra- 
ordinary.” He and his colleagues described the 
findings on 18 November in Science’. 

After a decade of planning, the project pen- 
etrated 1,335 metres into the sea floor off the 
coast of Progreso, Mexico, in April and May. 
Drillers hit the first peak-ring rocks at a depth 
of 618 metres, and a pinkish granite at 748 
metres. 

Geologists know that the granite must have 
come from relatively deep in the crust — 
perhaps 8-10 kilometres down — because it 
contains big crystals. The size of these crystals 
suggests that they formed through the slow 
cooling of deep, molten rock; by contrast, rapid 
cooling at shallow depth tends to form small 
crystals. Finding the granite relatively high in 
the drill core means that something must have 
lifted it up and then thrown it down on top of 
other rocks. 

That rules out one idea of how craters form, 
in which the pulverized rock stays mostly in 
place like hot soup in a bowl. Instead, the core 
confirms the ‘dynamic collapse’ model of cos- 
mic impacts, in which the asteroid punches 
a deep hole in the crust, causing the rock to 
flow like a liquid and spurt skyward. That rock 
then falls back to Earth, splattering around in 
a peak ring. 

“The ring of mountains we observe is 
made of deep material uplifted and flipped 


2] 


Acore from the Chicxulub crater contains pinkish granite that hints at how the basin formed. 


on its head,” says Sean Gulick, a geophysicist 
at the University of Texas at Austin. Gulick 
co-led the Chicxulub expedition with Joanna 
Morgan, a geophysicist at Imperial College 
London. 


FREE SPACE 

Knowing that Chicxulub formed through 

dynamic collapse opens new ways to explore 

other worlds, says Kring. Last month, he 
and his colleagues 


“We gota showed that, like 
core better Chicxulub, the peak 
than we could ring of the Moon’s 
reasonably have  320-kilometre-wide 


Schrédinger crater 
is made of material 
from deep in the crust’. This means that robots 
or astronauts could visit Schrodinger to pick 
up samples of the lunar interior. 

Chicxulub is the only good example of a 
peak-ring crater on Earth. Studying peak 
rings in impact basins on other worlds could 
confirm whether the dynamic-collapse 
model holds true for different environ- 
ments, says David Baker, a planetary scien- 
tist at NASA’s Goddard Space Flight Center 
in Greenbelt, Maryland, who has published 


imagined.” 
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on peak-ring formation’. 

The Chicxulub core also explains a long- 
standing geophysical mystery — why seismic 
waves move surprisingly slowly in the peak- 
ring rocks. The answer is that the impact 
fractured the granite so badly that it became 
much less dense than typical granite, slowing 
the waves’ progress. 

Lower density also means that rock is more 
porous, providing space for microbes to move 
around inside. After the sterilizing heat of the 
Chicxulub impact had dissipated, organisms 
may have taken advantage of the lingering 
warmth. “Somehow, life finds its way into 
these high-porosity targets and takes advan- 
tage of the habitat created, says Gulick. An 
underground biosphere could have evolved 
over millions of years, surviving off chemical 
energy and warmth without sunlight. 

Project scientists have found cells and 
microbial DNA in the Chicxulub core, but are 
holding details for future publications. = 


1. Morgan, J. V. et al. Science 354, 878-882 (2016). 
2. Kring, D.A., Kramer, G. Y., Collins, G. S., 
Potter, R. W. K. & Chandnani, M. Nature Commun. 7, 
13161 (2016). 
3. Baker, D. M. H., Head, J. W., Collins, G.S. & 
Potter, R. W. K. Icarus 273, 146-163 (2016). 
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Gene editing could improve the ability of immune cells (spherical) to attack cancer. 


CRISPR gene editing 
tested in a person 


Trial could spark biomedical duel between China and US. 


BY DAVID CYRANOSKI 


Chinese group has become the first to 
A: ect a person with cells that contain 

genes edited using the revolutionary 
CRISPR-Cas9 technique. 

On 28 October, a team led by oncologist Lu 
You at Sichuan University in Chengdu deliv- 
ered the modified cells into a patient with 
aggressive lung cancer as part of a clinical trial 
at the West China Hospital, also in Chengdu. 

Earlier clinical trials using cells edited with 
a different technique have shown promise at 
treating disease. The emergence of CRISPR, 
which is simpler and more efficient than 
other techniques, will probably accelerate the 
race to get gene-edited cells into the clinic, 
says Carl June, who specializes in immuno- 
therapy at the University of Pennsylvania in 
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Philadelphia and led one of the earlier trials. 

“T think this is going to trigger ‘Sputnik 2.0, 
a biomedical duel on progress between China 
and the United States, which is important 
since competition usually improves the end 
product,” he says. 

June is also the scientific adviser for a 
planned US trial that will use CRISPR to target 
three genes in cells extracted from participants, 
with the goal of treating various cancers. He 
expects the trial to start early next year. In 
March 2017, a group at Peking University 
in Beijing hopes to start three clinical trials 
using CRISPR against bladder, prostate and 
renal-cell cancers. Those trials do not yet have 
approval or funding. 

Lu’s trial received ethical approval from a 
hospital review board in July. Injections into 
participants were supposed to begin in August 


Long-sought @ The maths of science’s broken 
signal deepens | reward system 

mystery of @ Laser used to control mouse’s 
fast radio brain — and speed up milkshake 
bursts consumption 


Nations’ DNA 


@ European diseases left mark on First 


IN FOCUS 


but the date was pushed back, Lu says, because 
culturing and amplifying the cells took longer 
than expected and then the team ran into 
China’s October holidays. 

The researchers removed immune cells 
from the recipient’s blood and then disabled 
a gene in them using CRISPR-Cas9, which 
combines a DNA-cutting enzyme with a 
molecular guide that can be programmed to 
tell the enzyme precisely where to cut. The dis- 
abled gene codes for the protein PD-1, which 
normally puts the brakes on a cell’s immune 
response: cancers take advantage of that func- 
tion to proliferate. 

Lu’s team then cultured the edited cells, 
increasing their number, and injected them 
back into the patient, who has metastatic non- 
small-cell lung cancer. The hope is that, with- 
out PD-1, the edited cells will attack and defeat 
the cancer. 

Lu says that the treatment went smoothly, 
and that the participant will get a second injec- 
tion, but declined to give details because of 
patient confidentiality. The team plans to treat 
ten people in total; each will receive either two, 
three or four injections. It is primarily a safety 
trial, and participants will be monitored for six 
months to determine whether the injections 
are causing serious adverse effects. Lu’s team 
will also watch them beyond that time to see if 
they seem to be benefiting from the treatment. 

Other oncologists are excited about 
CRISPR’s entry onto the cancer scene. “The 
technology to be able to do this is incredible,” 
says Naiyer Rizvi of Columbia University 
Medical Center in New York City. Antonio 
Russo of Palermo University in Italy notes that 
antibodies that neutralize PD-1 have success- 
fully kept lung cancer in check, boding well for 
a CRISPR-enabled attack on the protein. “It’s 
an exciting strategy,” he says. “The rationale is 
strong.” 

But Rizvi questions whether this particular 
trial will succeed. The process of extracting, 
genetically modifying and multiplying cells 
is “a huge undertaking and not very scalable’, 
he says. “Unless it shows a large gain in 
efficacy, it will be hard to justify moving for- 
ward.” He doubts it will be superior to the 
use of antibodies, which can be expanded to 
unlimited quantities in the clinic. Lu says that 
this question is being evaluated in the trial, 
but that it’s too early to say which approach 
is better. = 
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Brazilian President Michel Temer wants to freeze real-terms federal spending for the next two decades. 


Brazil’s scientists 
fight funding freeze 


Proposed law could restrict research spending for 20 years. 


BY CLAUDIO ANGELO 


razil’s science budget has shrunk by 
B more than 40% in the past 3 years. But 

researchers are now trying to escape 
an even worse crisis: federal science spend- 
ing could be frozen at its current low level for 
the next two decades, under a constitutional 
amendment to cap public spending to infla- 
tion-level rises. 

On 16 November, 19 institutions repre- 
senting higher education and research sent 
a letter to Brazil’s Senate asking that politi- 
cians exclude science and education from 
the amendment, which is President Michel 
Temer’s top legislative priority as he aims to 
halt the country’s spiralling fiscal crisis. The 
amendment passed the lower house of Brazil’s 
National Congress last month; the Senate is 
due to vote on it by the end of this month. 

If Temer’s legislation does end up constrict- 
ing federal science spending, “it will be the 
end of science in Brazil’, warns physicist Luiz 
Davidovich, who is president of the Brazilian 
Academy of Sciences in Rio de Janeiro, which 
co-authored the letter. 

Last month, scientists joined protests in 
Rio and other cities against the amendment, 
which they called the #PECdoFimDoMundo 
(amendment of the end of the world). 

Brazil’s public spending does need to be 
reined in, says Carlos Américo Pacheco, 
chief executive of the Sao Paulo Research 


Foundation, a state science-funding agency. 
“It's a bitter medicine, but it’s inevitable: you 
wont be able to fix public finance without a 
major effort to cut expenses,” he says. 

But scientists say that they've had enough 
pain. In 2013, the science-ministry budget 
stood at 7.9 billion reais (US$2.3 billion). After 

an economic slump, 

the budget has now 

fallen to 4.6 billion 

reais — the low- 

est level in 7 years, 

and the lowest in a 
decade when inflation is accounted for. And 
that includes the budget of the communica- 
tions ministry, which Temer merged with 
the science ministry to create the Ministry of 
Science, Technology, Innovations and Com- 
munications (MCTIC) when he took office 
in May. 


DRAMATIC MOVES 
Asa result of the cuts, some Brazilian scien- 
tists have left the country; others are making 
do on less funding. And officials at major 
planned facilities such as the 850-million-real 
Multipurpose Research Reactor outside Sao 
Paulo — which aims to generate medical radi- 
oisotopes — and the 510-million-real Sirius 
synchrotron, in Campinas, have complained 
that they haven't yet received their 2016 funds. 
Freezing the science budget at these low 
levels could be disastrous, says Helena Nader, 
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president of the Brazilian Society for the 
Advancement of Science. Federal science 
spending is currently around 1.1% of Bra- 
zil’s gross domestic product (GDP), but that 
figure will fall if spending is frozen and GDP 
grows over the coming years. Nader compares 
that with science spending in China, which is 
planned to grow from 2.1% of GDP now to 
2.5% in 2020. 

The MCTIC says that the amendment will 
not necessarily affect the science budget. 
Science might even get more money — pro- 
vided that other areas of federal funding get 
less. But critics say that’s unlikely in a country 
with other pressing needs, including chroni- 
cally underfunded public education and 
health and a collapsing pensions system. 


NEXT YEAR’S BUDGET 

MCTIC minister Gilberto Kassab is trying 
to increase the science budget ahead of the 
amendment. His 2017 request, included in 
a government proposal that has not yet been 
voted on by Congress, would increase the 
budget by 28% to 5.9 billion reais. 

“The MCTIC has managed to include in the 
annual budget a value that’s enough to meet 
the needs of the sector,’ a spokesperson told 
Nature. 

Davidovich disagrees. He says that, after 
accounting for inflation, research needs 
at least 10 billion reais to match the federal 
science spending levels of three years ago — so 
it is well under par. 

To scientists’ further dismay, the 
National Council for Scientific and Tech- 
nological Development — a federal funding 
agency — has been further demoted within 
the new structure of the MCTIC. The council 
used to be an independent body whose presi- 
dent would report directly to the science min- 
ister; now it and other institutions such as the 
Brazilian Space Agency will report to a direc- 
tor who in turn will report to a vice-minister 
for science. “We [science] have been shoved to 
the fourth level,” says Nader. 

The political downgrading will make it 
harder for the council to fight for resources, 
say Nader and Davidovich. Last month, the 
Brazilian Society for the Advancement of 
Science and the Brazilian Academy of Sciences 
delivered a joint letter to Kassab protesting 
against the move. The minister said that he 
would review the changes in January. 

For now, Brazilian science faces a cloud of 
uncertainty. As researchers have asked, the 
amendment could be altered to spare sensi- 
tive areas such as education and research. 
Senators might also try to spare a specific 
slice of the MCTIC budget that comes from 
the revenue of some businesses (such as 
oil and telecommunications), Davidovich 
thinks. It is hard to predict how the amend- 
ment might unfold in the Senate, he says. 
“But then again, someone has said that, in 
Brazil, even the past is unpredictable.” = 


Geneticists seek keys 
to bat vocalizations 


Bat 1K project will sequence DNA of more than 1,000 species. 


BY RAMIN SKIBBA IN SAN DIEGO, 
CALIFORNIA 


But how they learn their calls and melo- 

dies is a mystery — one that scientists will 
try to solve by sequencing the genomes of more 
than 1,000 bat species. 

The project, called Bat 1K, was announced 
on 14 November at the annual meeting of the 
Society for Neuroscience in San Diego, Cali- 
fornia. Its organizers also hope to learn more 
about the flying mammals’ ability to navigate 
in the dark through echolocation; their strong 
immune systems that can shrug off Ebola; and 
their relatively long lifespans. 

“The genomes of all these other species, 
like birds and mice, are well-understood,” 
says Sonja Vernes, a neurogeneticist at the 


S ome bats sing just as birds and humans do. 


Max Planck Institute for Psycholinguistics in 
Nijmegen, the Netherlands, and co-director of 
the project. “But we don't know anything about 
bat genes yet.” 

Some bats show babbling behaviour, 
including barks, chatter, screeches, whis- 
tles and trills, says Mirjam Knérnschild, a 
behavioural ecologist at the Free University of 
Berlin. Young bats learn the songs and other 
sounds from older male tutors. They use these 
sounds during courtship and mating, when 
they retrieve food and as they defend their 
territory against rivals. 

Scientists have studied the vocal sounds 
of only about 50 bat species so far, Knérn- 
schild says, and they know much less about 
bats’ communication than birds’ Four species 
of bat have so far been found to learn vocal 
sounds from each other, their fathers and 
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other adult males, just as a child gradually 
learns how to speak from its parents (S. C. 
Vernes Psychon. Bull. Rev. http://doi.org/btcf; 
2016). 


SINGING LESSONS 

Genetic studies have identified at least one 
gene in bats that is linked to speech and lan- 
guage, called FOXP2 (M. Knornschild Curr. 
Opin. Neurobiol. 28, 80-85; 2014). The gene 
is also known to have a role in how people 
learn language, and in vocal learning in song- 
birds. Researchers working on the Bat 1K pro- 
ject expect to find that other genes are also 
involved in communication, and that many 
more bat species have the ability to learn 
songs, calls and other sounds. “It’s not a rare 
trait,” Knérnschild says. “I'm becoming con- 
vinced that there's a whole continuum in bat 
vocal learning” 

Bats’ echolocation ability has been studied 
for many years, partly because of its applica- 
tions to sonar and radar. But scientists know 
very little about the acoustic communication 
and social behaviour that drive how bats learn 
their songs and sounds, says Michael Yartsev, 
a neurobiologist at University of Califor- 
nia, Berkeley. The study of vocal learning 
in bats is “nearly completely untapped”, he 
says — likening it to the state of research into 
birdsong 60 years ago. m 
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The sparrow 
with four sexes = 


Ecologist Elaina Tuttle spent her life trying to 
understand the bizarre chromosome evolution of 
acommon bird — until tragedy struck. 


BY CARRIE ARNOLD 


at Cranberry Lake — and there was one sound that he would 
never hear again. 

Every summer for more than 25 years, Gonser and his wife, Elaina 
Tuttle, had made the trip to this field station in the Adirondack Moun- 
tains — a 45-minute boat ride from the nearest road. Now, as he moored 
his boat to the shaky wooden dock, he heard a familiar and short song 
that sounded like ‘oh-sweet-Canada. The whistle was from a white- 
throated sparrow calling hopefully for a mate. 

What he didn’t hear was the voice or laughter of his wife. For the first 
time, Gonser was at Cranberry Lake alone. Just a few weeks earlier, 
Tuttle had died of breast cancer. 

Her entire career, and most of Gonser’s, had been devoted to under- 
standing every aspect of the biology of the white-throated sparrow 
(Zonotrichia albicollis). Less than six months before she died this year 
at the age of 52, the couple and their team published a paper’ that was the 
culmination of that work. It explained how a chance genetic mutation 
had put the species on an extraordinary evolutionary path. 

The mutation had flipped a large section of chromosome 2, leaving it 
unable to pair up with a partner and exchange genetic information. The 
more than 1,100 genes in the inversion were inherited together as part of 
a massive ‘supergene and eventually drove the evolution of two different 
‘morphs’ — subtypes of the bird that are coloured differently, behave dif- 
ferently and mate only with the opposite morph. Tuttle and Gonser’s leap 
was to show that this process is nearly identical to the early evolution of 
certain sex chromosomes, including the human X and Y. The research- 
ers realized that they were effectively watching the bird evolve two sex 
chromosomes, on top of the two it already had. 

“This bird acts like it has four sexes,’ says Christopher Balakrishnan, 
an evolutionary biologist at East Carolina University in Greenville, 
North Carolina, who worked with Tuttle and Gonser. “One individual 
can only mate with one-quarter of the population. There are very few 
sexual systems with more than two sexes.” 

The work helps to explain a long-standing puzzle for biologists. It 
shows how two identical chromosomes can evolve into distinct sub- 
types that can define the sexes of a species and their different behav- 
iours. “These birds are an amazing system,’ says Catherine Peichel, an 
evolutionary ecologist at the University of Berne. “The process of sex- 
chromosome evolution tends to erase much of the evidence of how it 
happened, so being able to watch the process in action is a huge benefit.” 

What makes Tuttle and Gonser’s project even more unusual is the 


Tee was one sound that biologist Rusty Gonser always heard 
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accumulation of almost 30 years of data, “something that is almost 
unheard of in biology now’, says Melissa Wilson Sayres, a computa- 
tional biologist at Arizona State University in Tempe. “Most people 
tend to jump from project to project.” 

Gonser is determined to carry on the project. He returned to the 
field station this summer to continue Tuttle’s legacy and use this 
drab little garden bird to understand how sex chromosomes may 
have evolved. “Who knows — there might be many more species 
that have weird sex chromosomes and we've just never bothered to 
look,” he says. 


A FLOCK OF IDEAS 

Gonser’s mind moves at a mile a minute, his thoughts racing ahead 
of his grey Honda Civic. Dressed in basketball shorts and a T-shirt, 
and sporting days-old stubble, he speeds west on the I-70 interstate 
towards Indiana State University in Terre Haute, where he and Tuttle 
shared an office. As ideas tumble out — about new research projects, 
people to meet, restaurants to avoid — he pauses briefly and apolo- 
gizes. “Sorry if I'm repeating myself. My brain hasn't been working 
right since ...” he pauses briefly, and clears his throat, “since Elaina 
passed” 

Tuttle and Gonser first met in 1991 at the State University of New 
York at Albany, where both were pursuing PhDs in ecology. Gonser 
was investigating the Puerto Rican frog Eleutherodactylus coqui and 
Tuttle was studying fish ecology at New York's Finger Lakes. Long 
months in the field suited her love for the natural world, and it was 
there that she got to know the curious lifestyle of the white-throated 
sparrow. 

The bird is relatively common in gardens across the eastern half of 
North America — and on first glance it is rather plain. All the birds 
have mostly tan and grey plumage, except for a patch of white under 
the chin and a bright pop of yellow between the eyes and beak. But 
closer scrutiny reveals the two types: some have white stripes on their 
heads, and some have tan stripes. And, as bird spotters and naturalists 
have long known, the two morphs behave in different ways. 

The tan-striped birds are poor at singing, monogamous and 
fiercely protect their hatchlings from predators such as raccoons and 
snakes. The white-striped ones are aggressive, promiscuous, more 
cavalier about their offspring, and tuneful: Gonser says that they 
produce a more operatic refrain of oh-sweet-Canada. White-striped 
birds seem to mate only with tan-striped ones — a relatively unusual 
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Opposites attract 


The white-throated sparrow, 
common in eastern North 
America, has either white or tan 
stripes on its head. White-stripes 
mate only with tan-stripes, and 
vice versa. Why these two 
morphs behave so differently 
was a long-standing puzzle. 
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What’s up with chromosome 2? 


An ancient x Inversion 
mutation inverted ) , 

a large section of 4 

chromosome 2, 

creating a The inverted region is unable to 
‘supergene’. exchange genetic information with 


KKK 


its partner chromosome, so 
mutations accumulate. 


The chromosomes diverged, driving different behaviours. 
This is thought to mirror the process by which mammalian 
(XY) and bird (WZ) sex chromosomes evolved. 


Like having four sexes 
Males always mate with females 
and 
white-stripes almost always mate with tan-stripes (disassortive mating). 
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On average, half the offspring inherit the inverted version of chromosome 2, 
producing half tan-striped and half white-striped offspring. 
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phenomenon called disassortative mating (see ‘Opposites attract’). 
Tuttle became interested; why do the two morphs behave in this way? 

The literature already contained a huge clue. In 1966, ornithologist 
H. B. Thorneycroft published a paper in Science’ pointing out the bird’s 
unusual chromosome pair. Tan birds carry two identical copies of chro- 
mosome 2, but in white birds, one copy contains an inversion. It’s as if 
someone had taken a pair of scissors, snipped out most of the chromo- 
some and placed the DNA back in reverse. A chromosomal inversion 
this large was rare in vertebrates, Thorneycraft noted. It looked like dis- 
assortative mating maintained the two morphs in equal proportions in 
the population, because Mendelian inheritance ensures that, on average, 
half the offspring of a white-tan pair will inherit the inverted version of 
chromosome 2. But it would take more work to prove that this was true. 

To Tuttle, it was a fascinating puzzle — a way to shed light on chro- 
mosome evolution, as well as the genes underlying social behaviour. 
In the early 1990s, however, it was too expensive and laborious to find 
answers by sequencing the bird’s genome. So Tuttle initially focused on 
collecting more detail about their behaviour, such as how they selected 
mates and where they built nests. The goal was to understand what 
might affect offspring survival. She caught and tagged birds, drew blood 
samples and perfected the art of collecting semen. “Elaina was the best 
bird masturbator I ever met,’ Gonser says. 

Gonser soon became drawn into the work. (The couple married 
in 1994.) After the birth of their son Caleb in 2000, the family began 
spending summers at Cranberry Lake, and they slowly learnt more 
about the sparrow. For a 2003 paper’, Tuttle used genetic analysis of 
nestlings to quantify the different reproductive strategies of the two 
morphs. She showed that nearly one-third of the offspring fathered by 
white-striped males were not born to females with whom they shared 
a nest. Tan males, by contrast, spent less of their energy finding extra 
partners and more time guarding their own, so they were more likely 
to have fathered the chicks they were raising. Although the two morphs 
went about reproduction in very different ways, both achieved equal 
reproductive success. 

Six years later, the team secured a grant from the US National Insti- 
tutes of Health to start genetic analyses. They mapped chromosome 2 
in detail* and found that the changes were not a single inversion as 
Thorneycroft had indicated, but actually a series of inversions within 
inversions that scrambled the order of genes. They identified several 
genes that seemed to be associated with feather colour and behaviour, 
and that might explain the differences between the two morphs. “It’s 
really, really rare to find such a direct relationship between a set of genes 
and behaviours. That’s what makes these birds so interesting to study,” 
says Donna Maney, a neuroendocrinologist at Emory University in 
Atlanta, Georgia, who uses the white-throated sparrow as a model to 
understand how hormones affect the brain. 

But then, in 2011, just as Tuttle and Gonser were ramping up their 
genome sequencing, a routine mammogram revealed a lump in one 
of Tuttle's breasts. A biopsy confirmed that it was invasive cancer. The 
couple was shocked — their son was only 11 — but a mastectomy and 
the drug tamoxifen seemed to contain the disease. Determined not to 
let the cancer control her life, Tuttle pressed on with her work. 


DOUBLY DIVIDED 
By this point, the researchers had come up with the concept that the 
birds were evolving a second set of sex chromosomes. “It’s a bizarre idea, 
but it just makes sense from the data,’ said Alan Bergland, a geneticist 
who studies the molecular basis of evolutionary adaptation at the Uni- 
versity of Virginia in Charlottesville. No other species was known to 
have one set of fully operational sex chromosomes and another pair that 
subdivided the species again on another aspect of mate choice. 
Evolution has led to many weird and wonderful sexes and means of 
determining them. Some animals, such as reptiles, have two sexes and 
no sex chromosomes, whereas the freshwater protozoan Tetrahymena 
thermophila has seven sexes, each of which can mate with any type 
except its own. Two sexes with one set of sex chromosomes is the most 
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Elaina Tuttle dedicated her scientific career to studies of the white-throated sparrow. 


common arrangement, and has evolved independently many times. But 
there’s no reason a species can't have more sex chromosomes, Wilson 
Sayres says. “If you've got two genes linked together that can’t cross 
over during meiosis, and one of them plays a role in sex determination, 
suddenly you can have a new sex chromosome.” 

Researchers believe that the X and Y chromosomes in many mam- 
mals, and the W and Z in birds, emerged from a major inversion on 
a normal pair of chromosomes that prevented them from swapping 
genetic material, or ‘crossing over’. The suite of genes in the inversion 
was cemented together as a supergene that was inherited in one large 
chunk. On both the Y and Z chromosomes, the inversion shifted the 
location of a gene that determined male or female sexual develop- 
ment, respectively (male birds are ZZ and females are ZW). Over time, 
the Y and Z chromosomes accumulated mutations, because crossing 
over wasn't weeding them out. But, because all 
this happened in the distant evolutionary past, 
scientists had struggled to identify the precise 
steps involved. 

The sparrows offered such a chance. The inver- 
sion on chromosome 2 doesn't include genes that 
determine sexual development — but it does 
contain some that affect the birds’ reproductive 
behaviour. Over time, those genes diverged and 
drove the different characteristics of the two morphs. “Whatever genes 
control these behavioural differences will ultimately be traced back to 
this inversion,’ says Maney. 

But to truly show that chromosome 2 was evolving like a sex chro- 
mosome, Tuttle and Gonser would need to demonstrate that the genes 
in the inversion were acquiring mutations much faster than those on 
other chromosomes. This would prove that the white and tan morphs 
were using true disassortative mating, so that all pairs were white-tan. 
Evena tiny percentage of white-white matings would allow the inverted 
chromosome 2 to undergo crossing over that would help to cleanse it of 
mutations, and undermine the idea that it was acting like a sex chromo- 
some. Tuttle and Gonser would need to sequence the genomes of many 
birds, to show parentage and to compare mutation rates in the inversion 
with the rest of the genome. 

Finding DNA to sequence was easy. By now, Tuttle had several freez- 
ers full of blood samples collected over the years from thousands of 
birds. But other biology was getting in the way. 

In autumn 2013, a chronic cough sent Tuttle back to the doc- 
tor with what she thought was bronchitis. She learned that her 


484 | NATURE | VOL 539 | 24 NOVEMBER 2016 


“These birds are 
an amazing 
system.” 


cancer had returned and spread to her lungs. 
Genome sequencing of her tumour revealed 
that her original cancer was a mosaic of 
cell types; the tamoxifen had tamed the 
hormone-sensitive cancer cells, but the insen- 
sitive ones had survived. She underwent more 
chemotherapy, which held the cancer in check. 


THE LAST LAKE TRIP 

By the summer of 2015, the team had gath- 
ered both the ecological and the genetic data 
that they needed and were putting the finish- 
ing touches to their big paper — when a rou- 
tine scan revealed that Tuttle’s tumours were 
growing again. Tuttle made the 13-hour drive 
from Terre Haute to Cranberry Lake between 
chemo treatments. “We knew she was sick, but 
we didnt realize just how sick she really was,’ 
says graduate student Lindsay Forrette. Soon, 
Tuttle was getting increasingly grim predictions 
about how much time she had left. “We were 
all thinking she was going to beat the cancer, 
and when we finally understood she wasnt...” 
Gonser's voice trails off. When Tuttle realized 
she would never return to Cranberry Lake, she broke down and cried. 

In January 2016, when the paper was published in Current Biology, 
it showed unequivocally that chromosome 2 was evolving like a sex 
chromosome. White-white and tan-tan matings were exceedingly rare. 
Using the whole-genome sequences of 50 birds, the team demonstrated 
that the genes in the inversion were acquiring mutations much more 
quickly than elsewhere in the genome, a pattern that echoed the evolu- 
tion of sex chromosomes in humans and birds. 

In apress release, Tuttle said: “This is probably my pinnacle paper.” 
It would also be her last. By spring, her health had worsened. Just five 
days before she died, Tuttle busied herself from her hospital bed, writing 
more papers and helping graduate students to analyse data. She died on 
15 June. When Gonser announced the news, he received condolences 
from around the world. “Everyone who knew her, loved her. She was 
just that kind of person,” Gonser says. 

Tuttle leaves a rich research legacy that raises 
further questions, including whether this chromo- 
somal system is ultimately destined to disappear. 
Balakrishnan says it is unsustainable. “That we 
never see systems with four sexes says that they're 
evolutionarily unstable and one of these alleles will 
ultimately go extinct.” That's because in a four-sex 
system, each bird must work that much harder to 
find a mate — a white female is not just looking for a male, she needs 
a tan one — so selection will favour a more advantageous two-sex sys- 
tem instead. Balakrishnan intends to use the white-throated sparrow to 
further tease apart the genetic and environmental factors that drove its 
evolution in the first place. 

Gonser will keep going back to Cranberry Lake. He and his group want 
to better understand which genes control the sparrow’s behaviour — from 
mate selection to parenting — and how those characteristics were affected 
by the inversion. They are using digital maps and satellite data to chart 
nesting sites, track tagged birds and build a richer set of behavioural data. 

“There’s a lot more information left in these birds,” Gonser says. “And 
I think Elaina would like that we're trying to uncover their secrets.” = 


Carrie Arnold is a writer based near Richmond, Virginia. 
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MITSUO OHTSUKI/SPL 


Scanning transmission electron microscopy has been used to reveal the hexagonal arrangement of the uranium atoms in a solution of uranyl acetate. 


Fire up the atom forge 


Rethink electron microscopy to build quantum materials from scratch, 
urge Sergei V. Kalinin, Albina Borisevich and Stephen Jesse. 


lectron microscopy is on the brink ofa 

E transition. Soon the imaging tool could 

be used to create structures atom by 

atom. This sort of control over atomic archi- 

tecture could transform our basic scientific 

understanding of materials and pave the way 

to new classes of devices for quantum com- 
puting, spin sensing and more. 

Currently, the only way to build at the 


atomic scale is with a scanning tunnel- 
ling microscope (STM). But this approach 
remains impractical and niche, 25 years after 
Don Eigler demonstrated it by spelling out 
the name of his company, IBM'. An STM 
can move only surface atoms and structures 
that are stable only at low temperatures. 
The technique is time-consuming and it 
requires bespoke machines. It can now make 


computing devices comprising several quan- 
tum bits (qubits), but not much more’. 

Electron microscopy is reliable and widely 
used to view thin sections of materials. It 
uses a beam of electrons to reveal the mate- 
rial’s crystal structure (see “The atomic forge’). 
However, the beam can nudge atoms out of 
alignment and alter the material’s form. This 
is a problem for imaging, but could be a 
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THE ATOMIC FORGE 


A SCANNING TRANSMISSION ELECTRON MICROSCOPE (STEM) fires a beam of electrons through a sample of 
material to pinpoint atoms and reveal the material’s crystal structure. For imaging, the downside is that the 
beam can move atoms slightly. But if these modifications can be controlled, it could be a boon. New materials 
could be built atom by atom by controlling the electron beam precisely. Such bespoke materials might enable 


PRECISE BEAM 


A beam delivers a prescribed 
dose of electrons toa 
particular site in a crystal for 
a certain length of time to 
move an atom as desired. 


ad 


new classes of devices for quantum computing, spin sensing and more. 


BEAM CONTROL 


Open-source software 
gives researchers full 
control over the precision, 
velocity and power of the 
. electron beam. 


FAST DETECTOR 


Information about 

the sample’s atomic 
structure is gathered in 
real time by detectors 
with many pixels. 


NEW MATERIALS 


By testing new atomic architectures, 
researchers will be able to design materials 
with specific properties and learn more about 
interactions between the beam and sample. 
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FEEDBACK 
SYSTEM 


A powerful computer 
analyses the detector 
data inreal time 

and decides how to 
control the beam to 
make the next atomic 
move. 


a 


ILLUSTRATION BY XVIVO SCIENTIFIC ANIMATION 


> boon for fabrication — as long as the mod- 
ifications can be controlled. 

For building at the atomic level, scanning 
transmission electron microscopy (STEM) 
seems to be the most promising form of 
the technique. An electron beam as fine as 
an atom is scanned through the specimen 
to probe its structure’. Several groups have 
used STEM to transform the compositions of 
layered materials in places*”, crystallize amor- 
phous oxides®, move atoms between spaces 
ina lattice’ and knock out single atoms from 
one-layer materials. 

Manipulating atoms with STEM will 
require three key developments, practi- 
cal and theoretical, which we outline here. 
Practically, we need better real-time moni- 
toring, feedback and beam control. These 
systems should use imaging methods* based 
on diffraction signals. They will need detec- 
tors capable of being read out fast and “big 
data’ computation methods capable of han- 
dling high volumes and rates of data while 
performing analytics in real time. Materials 
scientists must also refine theories of how 
electrons and matter interact. 


THREE WAYS FORWARD 

Control the beam. In standard STEM imag- 
ing, an electron probe scans back and forth 
across the specimen. Commercial imaging 
systems typically trace only a zig-zag pattern 
over a rectangular area, because arbitrary 
beam paths can distort the image. To deliver 
a prescribed number of electrons to a speci- 
fied location at a particular time demands 
full control over the position, velocity and 
power of the electron beam. Fine control is 
also useful where different parts of a sample 
react differently to the electrons or when only 
a portion of the field needs to be examined. 
Different scanning paths are useful, however. 
For example, spirals are faster to scan because 
they avoid abrupt changes in direction and 
deliver electrons more evenly”. The small, 
high-end manufacturer Nion in Kirland, 
Washington, has made its beam-control 
software open source so that researchers can 
develop models for different scanning strate- 
gies. Others should follow suit. 


Monitor and feed back in real time. Irregu- 
larities and defects in lattices blur STEM 
images. To build atom by atom, irregularities 
need to be spotted and located quickly so that 
the fabricator can decide whether to correct, 
adjust or leave them in place. Currently, the 
best way to do this is to keep rescanning. This 
is too slow. 

Researchers will need to be able to monitor 
and respond to detector signals in real time. 
Through a system of immediate feedback, the 
beam could be controlled to alter the rate and 
extent of modifications or reactions. Until 
that becomes available, regular rapid scans 
of small areas (such as 10-millisecond spiral 


scans of 1 nanometre square) could be used. 
Images would need to be analysed in real time 
and algorithms developed for adjusting the 
scanning trajectories. The speed and quanti- 
ties of data involved would require powerful 
computers and software. 

In the long term, diffraction patterns could 
be used to track lattice structures, as is cur- 
rently done in scanning-diffraction imaging. 
Collecting transmitted electrons on a pixel- 
lated detector instead of a single ring pre- 
serves more information. For example, the 
electron beam may scatter off new or missing 
atomic columns or be affected by symmetries 
and asymmetries in the surrounding crystal. 
Libraries of standard diffraction images can 
be consulted to show that a structure has been 
fabricated successfully. 


Improve theory and modelling. Knowing 
more about how the electron beam interacts 
with matter is crucial for predicting how the 
fabrication system will behave. For instance, 
researchers need to know how slight changes 
in the positions of atoms in a column affect 
scattering. A better grasp is needed of cou- 
pled nonlinear effects, such as how the beam 
changes a material and thus how the two 
interact subsequently. 

Greater understanding is needed of how 
electrons lose energy as they pass through 
the lattice (such as 


through collisions “Electron- 

with atoms or by‘ MlLcroscopy 
exciting acous- centres should 
tic modes). Such help the 
information will community share 
help to predict knowledge about 


which stronginter- observations.” 
actions will knock 

out single atoms or displace them in cascades. 
The formation of new, stable configurations 
should be explored. 

All this knowledge would allow fabricators 
to exploit subtleties such as the way a material 
responds to scan direction, path shape or the 
pulsing of the beam. The prize will be worth 
the effort: the researcher who can devise a 
scan path to steer an atom to a predefined 
location will be able to make materials for 
quantum computing. 


NEXT STEPS 

Realizing these opportunities will require 
interdisciplinary research. Those who work 
on electron microscopy, data analytics, image 
analysis, quantum computing, microfabrica- 
tion, theory, molecular dynamics, computer 
visualization, artificial intelligence and instru- 
ment engineering must come together. Such 
diverse teams are rare, and each community 
has a different language and motivation. 

To nucleate this effort, we propose that 
professional societies and think tanks host 
interdisciplinary workshops to discuss 
the feasibility and specific needs of atomic 


SUMMARY 

@ Materials could be made from 
scratch by enhancing a microscope 
that uses an electron beam to image 
the structure of crystals. 


@ Making the electron beam 
programmable — to vary how many 
electrons are fired, to where and for 
how long — would allow atoms to be 
moved, added or knocked out. 


@ The structure of the material needs 
to be monitored in real time so that 
progress can be followed and mistakes 
corrected. 


@ Researchers need to know more about 
how electron beams interact with atoms 
in crystal lattices, including how energy 
dissipates. 


fabrication using STEM. Technical advances 
in the spatial resolution of STEM are equally 
important, laterally and along the beam. 

Electron-microscopy centres should help 
the community share knowledge about obser- 
vations, atomic configurations and beam- 
induced phenomena by setting up open and 
searchable databases with analysis tools. 
Common data formats and cloud computing 
facilities are needed to support data analytics. 
Our community should look to exemplars in 
other fields, such as X-ray, mass spectrometry 
and crowdsourcing platforms in high-energy 
physics, astronomy and genomics. 

Let us make a start: the ability to build new 
forms of matter from the atom up will marka 
new chapter of nanoscience. = 
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research-and-development staff, and 
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Conventional silicon solar cells account for more than 90% of global production, yet making them uses energy equivalent to about 10% of their lifetime output. 


Print flexible solar cells 


Pay more attention to developing thin, mass-produced, affordable 
photovoltaic devices, urge Yi-Bing Cheng and colleagues. 


he future of solar energy depends 
Te a union of new and old technolo- 

gies. If photovoltaic (PV) devices 
that turn light into electricity could be mass 
produced with printing presses, as if they 
were newspapers or banknotes, they could 
be affordable and ubiquitous. 

Conventional, silicon-based, solar panels 
are rigid and bulky. Small, thin and flexible 
PV devices on films are already being made 
that are lightweight and translucent. These 
use little material and can generate electricity 
in low light, even indoors. Integrating them 
into phones and watches, as well as walls 
and windows, would transform the world’s 
energy generation, reduce pollution and 
mitigate climate change. 

Yet flexible solar panels face several 
hurdles. Some are based on harmful 
substances such as heavy metals, and their 
manufacture uses hazardous solvents. Oth- 
ers are quick to degrade and inefficient 
at converting light into electricity. Print- 
ers used in the publishing, computing and 


electronics industries struggle to print PV 
materials that need to be built with nano- 
metre precision over many square metres. 
For all these reasons, printable solar cells are 
yet to find a foothold in electricity markets. 

Most research and development invest- 
ment goes into conventional silicon solar 
cells, which account for more than 90% of 
global production. Yet manufacturing these 
uses a lot of energy: equivalent to about 10% 
of the cell’s lifetime output'. 

Printed solar cells won't become wide- 
spread until they are cheaper and safer to 
make. Researchers and businesses must 
work together to improve the efficiency, 
environmental impact and stability of these 
cells, scale up their manufacture and plan 
their market penetration. 


PHOTOVOLTAIC PRIMER 

Mass production at low cost is what the 
solar industry sorely needs. The power that 
a PV panel generates is proportional to the 
surface area exposed to sunlight. The world 
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consumes approximately 20,000 terawatt- 
hours of electricity each year*. Meeting this 
need would require enough PV devices to 
cover around 100,000 square kilometres, an 
area about the size of Iceland. 

Such production rates have yet to be 
achieved. Printed PV devices are typically 
made from many layers of material ona 
substrate of conductive glass or plastic. 
Each layer has a function: semiconduc- 
tors or sensitizers absorb visible light, and 
other materials carry electrical charges to 
electrodes. 

Many types of printed PV device are 
being developed. Some feature organic 
semiconductors such as polythiophenes. 
Others use light-absorbing dyes, includ- 
ing ruthenium-based polypyridines. 
And in quantum-dot solar cells, nano- 
particles absorb light. Other examples 
feature semiconductors with a chalcogen 
element (sulfur, selenium or tellurium) 
or contain organic-inorganic light absorbers 
with a structure similar to that of the mineral 


RUEBEN WU 


US NATL RENEWABLE ENERGY LAB 


CSIRO 


perovskite. All of these are classed as thin- 
film solar cells. 

The most efficient are the perovskite- 
based cells. The latest of these, with just a few 
years of research behind them (see ‘Catching 
up), convert 22% of incident solar power to 
electrical power’. This is more efficient 
than solar cells made from multicrystalline 
silicon. But perovskite cells cannot be rolled 
out commercially yet because they degrade 
under high humidity and heat. 


FINE DETAIL 

Printing layers that are nanometres to 
micrometres thick — uniformly and with- 
out pinholes, and over many square metres 
— is difficult. The electronics industry com- 
monly uses screen-printing, feeding a paste 
through a perforated screen*. But the layers 
in printed circuit boards are hundreds of 
times thicker than those of PV devices. And 
turning materials into viscous pastes alters 
their physical and electrical properties. 

Other techniques for printing PV devices 
have been demonstrated in the lab over 
areas of about 10 square centimetres. These 
include feeding ink through a slit (slot-die 
printing), spray-coating the substrate, pass- 
ing the substrate over a rotating cylinder 
(gravure printing) and moving a blade over 
the substrate through an ink supply. 

Each technique has a downside. In slot- 
die printing, for example, the spacing of 
patterns is hard to control, and gaps reduce 
the active area of a panel. And in gravure 
printing, contact between the printer head 
and the substrate can damage underlying lay- 
ers. Such drawbacks mean that printed solar 
cells are less than half as efficient as the best 
non-printed equivalents’. 

To print thin, pinhole-free layers over 
more than one square metre will require 
intelligent, more-precise equipment and 
laser processing. An alternative approach 
would be to develop PV materials that work 


Roll-to-roll printing of organic photovoltaic cells 


CATCHING UP 


Thin-film solar cells that can be processed as 
solutions and might be printed compete 
increasingly with silicon photovoltaics. 


== Single-crystal silicon == Multicrystal silicon 
= Perovskite 
== Quantum dot 


— Dye-sensitized == Organic 


Perovskite cells are 
as efficient as silicon 
but are unstable. 


Efficiency (%) 


1985 1995 2005 2015 
with existing industrial printing methods. 

Printing requires that materials can be 
formed into a liquid, solution or paste. For 
PV devices, this means using either solutions 
of chemicals (polymers, dyes or hybrid per- 
ovskite, for instance) or dispersions of nano- 
particles (such as quantum dots). But many 
of these can degrade over days to weeks if not 
properly sealed; and more-stable alternatives, 
such as silicon, are harder to print. 

A balance must also be struck between 
the efficiency of a device and the environ- 
mental impacts of its manufacture. The 
most efficient thin-film solar cells include 
toxic or rare materials, such as cadmium, 
ruthenium and lead, as well as hazardous 
organic solvents. Indium, another rare ele- 
ment, isa common ingredient in transparent 
conductive films for PV devices, and its use 
is expected to rise. 

Depleted mineral deposits and low rates of 
recycling threaten to exhaust indium stocks 
before the end of the century. Researchers 


are looking for ways to use abundant 
materials that can be processed in solution 
to make efficient devices with little toxic 
waste. For example, carbon-based elec- 
trodes and layers use less precious metal’. At 
present, such designs are often less efficient 
than others. 


WHAT NEXT? 

The huge success of silicon panels has 
become a hurdle for emerging technolo- 
gies. The manufacturers of silicon-based 
PV devices share materials, equipment and 
practices with sibling industries, such as 
computing. Developers of printable devices 
are isolated. And the maturity of the silicon 
industry means there is little urgency to 
develop alternatives. Capital investment and 
product commercialization are perceived 
as risky, given that printable PV devices are 
still in development. 

Government funding is needed to 
catapult such devices from a nascent to a 
competitive state, as happened in China 
with the silicon PV industry. Market 
penetration should be developed in stages. 
Early printable PV devices should target 
weaknesses in silicon-based technologies, 
such as their poor performance in low light 
and their lack of portability. The next wave 
should complement silicon solar cells and, 
ideally, be integrated with them. For exam- 
ple, silicon—perovskite devices would harvest 
a greater fraction of incoming sunlight than 
silicon devices alone could do. 

If printed technologies can capture 5% 
of the PV market, their advantages should 
ensure that they play an ever-increasing 
part in meeting growing demands for 
renewable energy. m 


Yi-Bing Cheng is professor of materials 
science and engineering at the State Key 

Lab of Advanced Technology for Materials 
Synthesis and Processing, Wuhan University 
of Technology, China, and at Monash 
University, Clayton, Victoria, Australia. 
Alex Pascoe is a research fellow at Monash 
University, Clayton, Victoria, Australia. 
Fuzhi Huang and Yong Peng are professors 
at the State Key Lab of Advanced Technology 
for Materials Synthesis and Processing, 
Wuhan University of Technology, China. 


1. Photovoltaics Report (Fraunhofer Institute for 
Solar Energy Systems, 2016); available at 
go.nature.com/2eusg7r 

2. Key World Energy Statistics (IEA, 2016); available 
at go.nature.com/2ewux1w 

3. Best Research-Cell Efficiencies (US National 
Renewable Energy Laboratory); available at 
go.nature.com/2exvaqO 

4. Das, R. Printing Equipment for Printed Electronics 
2015-2025: Market Opportunities for Printing, 
Curing and Integration Equipment (I|DTechEx, 
2015). 

5. Ye, M., Hong, X., Zhang, F. & Liu, X. J. Mater. 
Chem. A 4, 6755-6771 (2016). 

6. Mei, A. et al. Science 345, 295-298 (2014). 


24 NOVEMBER 2016 | VOL 539 | NATURE | 489 
© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


Some of the Harvard Observatory ‘computers’ in 1925. Annie Jump Cannon is seated fifth from left; Cecilia Payne is at the drafting table. 


: 


Women who read the stars 


Sue Nelson delights in Dava Sobel’s account of a rare band of human computers. 


here are half a million photographic 
"Tose in the Harvard College Observa- 

tory collection, all unique. They date to 
the mid-1880s, and each can display the light 
from 50,000 stars. These fragments of the 
cosmos furthered our understanding of the 
Universe. They also reflect the dedication and 
intelligence of extraordinary women whose 
stories are more than astronomical history: 
they reveal lives of ambition, aspiration and 
brilliance. It takes a talented writer to inter- 
weave professional achievement with per- 
sonal insight. By the time I finished The Glass 
Universe, Dava Sobel’s wonderful, meticulous 
account, ithad moved me to tears. 

In the nineteenth century, US women 
had no vote (that would come in 1920). The 
careers open to the vast majority were in 
domestic service, farms, factories, schools 
or offices. Fortunately, Maria Mitchell, the 
first woman in the United States to discover 
a comet, left a guiding light. In 1865, she 
became the first professor of astronomy at 
Vassar College in Poughkeepsie, New York, a 
new institute of higher education for women. 

Yet change was slow. When Edward Picker- 
ing became director of the Harvard Observa- 
tory in Cambridge, Massachusetts, in 1877, 
being a ‘computer’ was still predominantly 
a man’s job. This low-paid but essential 
work involved noting a star’s brightness, or 


magnitude, on the 
photographic plates 
and computing its 
location in the sky. 
Pickering was espe- 
cially interested in 
variable stars, whose 
light would brighten 
and fade over a spe- 
cific period. These 
fluctuations, captured 
on the plates, required 


The Glass 
Universe: How 
the Ladies of 


constant observation, the Harvard 
but he couldn't afford Observatory Took 
the Measure of the 
extra staff. Stars 
Some years later, DAVA SOBEL 
opportunity arrivedin Viking: 2016 
the form of the wealthy 


Anna Palmer Draper, widow of amateur 
astronomer and physician Henry Draper. 
She set up a memorial to fund his great ambi- 
tion: to photograph the night sky and create 
a spectral catalogue of the stars. When star- 
light is split by a prism or spectroscope at the 
end ofa telescope, it can produce a tiny grey 
smudge, millimetres long, on a photographic 
plate. The separated light, on closer examina- 
tion, reveals vertical black spectral lines. This 
stellar barcode provides a star’s chemical com- 
position, colour and temperature. 

By 1883, six women were already working 


as computers at Harvard, a practice unique 
to the university. Within five years, the num- 
ber of paid female computers, from a range 
of backgrounds, had risen to 14. Their efforts 
would be boosted by philanthropist Cath- 
erine Wolfe Bruce, who in 1889 donated 
US$50,000 to the observatory, convinced 
that the introduction of photography and 
spectroscopy would advance the field. 

The Glass Universe concentrates on a few of 
the Harvard computers. Williamina Fleming, 
a Scottish school teacher, arrived at the obser- 
vatory in 1879, pregnant and abandoned by 
her husband. Pickering employed her as a 
maid and housekeeper before promoting 
her to computer in 1881. Fleming discovered 
more than 300 variable stars and, in 1899, 
became Harvard's first curator of astronomi- 
cal photographs. She and Pickering revised 
the classification of stars using their spectral 
lines, labelling them alphabetically from the 
bluest, hottest stars to the coolest red stars. 

Henrietta Swan Leavitt, who studied 
at another women’s institution, Radcliffe 
College in Cambridge, began as a volunteer 
in 1895, gaining a permanent post in 1903. 
Five years later, she determined the relation- 
ship between luminosity and period of fluc- 
tuation for a class of pulsing variable stars 
called Cepheids. That enabled astronomers to 
calculate a star’s distance from Earth, and 
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> in 1918 allowed Harlow Shapley, who 
would become Pickering’s successor, to 
extend the boundaries of the Milky Way. 
Leavitt's work also laid the foundation for 
Edwin Hubble’ finding, in 1925, that our 
Galaxy was nota lone ‘island Universe’ but 
one of billions. In 1929, Hubble used Leav- 
its work and spectral shifts to show that 
this populous Universe is also expanding. 
Annie Jump Cannon arrived in 1896, 
following studies in mathematics, physics 
and astronomy at another women's col- 
lege, Wellesley, in the Massachusetts town 
of the same name. She refined and simpli- 
fied the Pickering—Fleming system of stel- 
lar classification by proving that most stars 
were ofa similar type. The International 
Astronomical Union adopted her system 
in 1922, three years after Pickering’s death. 
Cecilia Payne, educated at the Uni- 
versity of Cambridge, UK, came to the 
observatory on a fellowship in 1923, 
under Shapley. As part of her PhD, she 
determined correctly that stars were pre- 
dominantly composed of hydrogen and 
helium. In 1956, she became Harvard’s 
first female professor of astronomy. 
Payne provides some of the book’s most 
touching passages. Sobel describes, for 
instance, Payne's distress at the untimely 
death of her friend and fellow computer 
Adelaide Ames in 1932, and her whirl- 
wind romance with the astronomer and 
Russian émigré Sergei Gaposchkin, whom 
she married in 1934. Later that year, Payne 
won the inaugural Annie Jump Cannon 
Prize for contributions to astronomy, 
receiving $50 and a gold pin, designed by 
Cannon, in the form ofa spiral nebula. 
These personal touches flesh out the 
women's lives. We are shown Payne, for 
instance, pawning her violin and jewel- 
lery to fund her research, and combining 
motherhood with astronomy. The low 
wages were a sticking point with many 
of them — Fleming often mentions it 
in her diaries. Yet, as Sobel shows, they 
clearly liked and admired Pickering and 
Shapley, who encouraged their work and 
facilitated their professional progress. 
One of the last computers in the 
twentieth century, Radcliffe alumna 
Ellen Dorrit Hoffleit, gets a brief mention 
towards the end. She joined the observa- 
tory in 1928. In 2004, I met her at Yale 
University in New Haven, Connecticut, 
when I was making a radio documentary 
for the BBC on the Harvard computers. 
An energetic 96, Hoffleit was working 
on a paper about meteors. Like all of 
this band of remarkable women, she was 
unforgettable, as is this book. m 


Sue Nelson is an award-winning science 
journalist and broadcaster. 
e-mail: sue@boffinmedia.co.uk 


492 | NATURE | VOL 539 | 24 NOVEMB 
© 20] 


C 


Life at the divide 


Alison Abbott hails a memoir from Italian senator and 
biologist Elena Cattaneo, scourge of pseudoscience. 


ne hot day in August 2013, the 
() phone rang in the laboratory of 

neuroscientist Elena Cattaneo at the 
University of Milan. The then-president of 
Italy, Giorgio Napolitano, wished to speak 
to her in Rome as soon as possible. 

Cattaneo was already a minor national 
figure, known for her opposition to all 
forms of pseudoscience, however powerful 
the perpetrators. In one of her first public 
fights, in 2009, she criticized the Vatican for 
condemning the use of human embryonic 
stem cells in research. At the time of the call, 
she was in the middle of a battle to stop the 
government supporting a clinical trial of 
an unproven stem-cell ‘therapy run by the 
maverick Stamina Foundation in Brescia. 
The call made her wonder whether she had 
overstepped some mark. 

Napolitano had something quite differ- 
ent to discuss at his residence in Rome, the 
baroque Quirinal Palace. He told Cattaneo, 
then aged just 51, that he wanted her to 
become a senator for life: what did she think? 

“It was as if my every atom had been 
paralysed by the question,” she recalls in her 
book Ogni giorno: Tra scienza e politica (Every 
Day: Between Science and Politics). “Seconds 
passed without me being able to line up a 
meaningful sentence. The President realised, 
and touching my arm as if to shake me, he 
jokingly said: ‘Professor, can I get you a cor- 
dial?” 

Cattaneo works on Huntington's disease, 
an incurable genetic disorder in which brain 


Elena Cattaneo. 
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Ogni giorno: Tra cells progressively die. 


scienza e politica Stem-cell research has 
(Every Day: provided new insights 
Between Science into the disease, which 
and Politics) is why she had fought 
ELENA CATTANEO 3 

Mondadori: 2016. so hard for the right 


to do it responsibly. 
Those battles stole precious time and energy. 
Would being a senator for life — with the 
inevitable widening of her repertoire of fights 
— drain too much from her scientific life? 

Ogni giorno is a fascinating account of how 
a scientist entered the messy business of poli- 
tics (arguably more so in Italy than in other 
rich countries) and learnt to survive. Cattaneo 
accepted the position. As one of only a few 
appointed by the president, the role signifies 
a person's importance to national culture. 
But she set boundaries. She would not give 
up her lab, and in the Senate she would engage 
mainly with themes relating to science. She 
would apply the same rigour to evidence 
supporting a political hypothesis as she 
would to that supporting a scientific one. She 
would not vote if such evidence were miss- 
ing, and she would vote only according to her 
conscience, never along any party lines. 

In August that year, she was sworn in along 
with conductor Claudio Abbado (who died 
five months later), architect Renzo Piano and 
Nobel-prizewinning physicist Carlo Rubbia. 
She threw herself into action, appointing a 
small team to help with research and admin- 
istration, and organized a series of workshops 
with invited scientists from around the world. 
The aim was to educate parliamentarians in 
themes such as science, health and innova- 
tion, neuroscience and cell therapy. 

She quickly learnt about Senate investiga- 
tions and organized one into the Stamina 
debacle, a process that took more than a 
year. Published in February 2015, the report 
assigns responsibility for every step in the 
affair that allowed public hospitals to host 
the treatment, the government to support 
it and numerous courts to rule that patients 
had the right to it — even though it had 
been condemned as dangerous by the Italian 
health authorities. 

Her repertoire of battles, as anticipated, did 
expand and she found herself on steep learn- 
ing curves. One of these was genetically mod- 
ified (GM) crops. In 2015, Italy’s parliament 
debated the sensitive issue of whether each 
European Union member state should have 
the right to ban cultivation in their territories 
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of GM crops that had been approved 
by EU authorities. Giving evidence at a 
Senate hearing, veterinary scientist Fed- 
erico Infascelli described experiments in 
which he had apparently showed harmful 
effects in animals that were fed GM crops. 
Cattaneo spent the weekend poring 
over his papers. She could not understand 
the data, and discussions with Infascelli 
did not help. So she prompted investiga- 
tions into his work, which eventually saw 
some of the papers retracted. Her team 
went on to produce a 1,556-page docu- 
ment for the Senate compiling nearly all 
available scientific evidence about GM 
crop safety. For her pains, Cattaneo was 
demonized by some in the Senate as a lob- 
byist for the agricultural biotechnology 

firm Monsanto. 
She has confronted many other issues 
in the Senate 
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some govern- 
ment decisions relating to science, the 
murky proposal for a major research 
centre to be built on the site of the 2015 
Expo in Milan, and issues such as research 
financing. 

Ogni giorno is a dispatch from a strong 
woman fighting in a hostile environment. 
She sees an anti-intellectual tendency in 
her parliament, an anti-scientific attitude 
in the Italian judiciary and a reluctance 
of most scientists in the country to speak 
up in scientific controversies — “leaving 
the floor open to pseudoscientists and 
charlatans”. 

Inevitably, she has made enemies. But 
she has made friends, too. In June, a total 
stranger, an accountant from the small 
provincial town of Molinella near Bolo- 
gna, left his entire fortune of more than 
€1.5 million (US$1.6 million) to her to 
distribute to research in ways she sees fit. 

Having reported these various stories 
for years, I was fascinated to learn from 
the book how many more weapons one 
has when fighting battles for science from 
within a political system — but just how 
taxing those battles can be. 

Cattaneo might turn out to be the last 
senator for life to be appointed. There 
will be a referendum on the Italian con- 
stitution on 4 December. A ‘yes’ vote will 
lead to a smaller, less powerful Senate in 
which the president will have the right to 
directly appoint a handful of senators for 
only a seven-year period. = 


Alison Abbott is Nature’ Senior 
European Correspondent. 
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A Glanville fritillary butterfly (Melitaea cinxia), subject of a unique long-term data-collection effort 


Winged insights 


H. Charles J. Godfray is inspired by the scientific 
memoir of late island ecologist kka Hanski. 


ombining a personal memoir with 
( serious discussion of a scientific 
subject is a difficult literary trick. The 
Finnish biologist Ilkka Hanski succeeded 
with aplomb in his last book, Messages 
from Islands, in which each chapter begins 
with insights from an island that moulded 
his thinking about ecology, evolution and 
conservation. Hanski, one of the foremost 
ecologists of his generation, died in May 
(A.-L. Laine Nature 534, 180; 2016). 
Finland is a land of lakes and islands, 
so perhaps it is not surprising that Finn- 
ish ecologists are drawn to investigating 
how populations and communities persist 
in fragmented habitats. Hanski is most 
celebrated for developing the ecological 
concept of a metapopulation — a popula- 
tion of populations connected by disper- 
sal — and its applications to conservation. 


There are several 
types, but a classical 
metapopulation is 
sometimes likened to 
a collection of “blink- 
ing lights”, with indi- 
vidual short-lived 
populations winking 
in and out of exist- 
ence while the whole 
ensemble persists. 
Hanski explored 


Messages from 
Islands: A Global 
Biodiversity Tour 


ILKKA HANSKI the concept through 
University of Chicago his 25- year, an d 
Press: 2016. ‘ 


ongoing, study of 
the Glanville fritil- 
lary (Melitaea cinxia) in the Aland Archi- 
pelago between Finland and Sweden. This 
checkerspot butterfly has exacting habitat 
requirements: it occupies a fluctuating > 
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> number of the small woodland meadows 
that constitute a habitat archipelago within 
the geographical archipelago. The meadows 
are so tiny that they support only a small 
butterfly population; each has a high risk of 
extinction every year. 

Hanski, his colleagues and an ever- 
changing army of students surveyed all 
4,000 or so meadows, which support 
400-800 populations each year. Through 
this and many experiments, such as quan- 
tifying rates of dispersal between patches, 
they constructed a model of the butterfly’s 
metapopulation — the most detailed and 
satisfying description of such a popula- 
tion structure currently available, by some 
distance. 

Aland, of course, is the basis of a 
chapter. We learn how Hanski drew on 
the deep knowledge of Finnish butterflies 
and moths that he gained as a keen teen- 
age naturalist in his search for a suitable 
model system. As elsewhere in the book, 
he explores much broader questions in 
population biology — for example, how 
agriculture, deforestion and other types of 
habitat change have shattered what would 
once have been contiguous habitats. Many 
more species live in metapopulations today 
than in the past, so understanding them 
can help in the design of protected areas, 
increasing the chance that a threatened 
species will persist. 

Hanski’s passionate concern for 
biodiversity is evident throughout the 
book. The chapter on Madagascar dis- 
cusses the island’s community of nearly 
300 species of dung beetle (the subject of 
Hanski’s doctorate and an abiding passion 
throughout his life). Madagascar has expe- 
rienced extensive deforestation in the past 
century, making it likely that some of the 
dung-beetle species known from nine- 
teenth-century collections are now extinct. 
More positively, Hanski and his students 
have documented how some species have 
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Biologist IIkka Hanski, who died this year. 
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A few of the 300 dung-beetle species endemic to Madagascar. 


evolved to utilize the dung of introduced 
livestock such as cattle. 

This leads to a more general discussion 
of how biodiversity is generated, for exam- 
ple drawing lessons from Darwin's finches 
and pollinating bees about when natural 
selection favours specialization or being a 
jack-of-all-trades. That, in turn, leads back 
to the Glanville fritillary, in which Hanski 
has demonstrated genetic differences in the 
colonization ability of butterflies from the 
core and periphery of the metapopulation. 

Haminanluoto is the smallest island dis- 
cussed in the book. This 2-hectare islet is on 
the south coast of Finland near the village of 
Hanski, named after a seventeenth-century 
ancestor of the ecologist. At the age of 14, 
Hanski counted 15 species of bird on the 
island. Repeat counts in 2003 and 2013 
found about the same number of species, 
although some of the original species had 
gone and new ones had colonized the island. 
Hanski muses on how human activities 
have affected the natural dynamics of spe- 
cies turnover. This leads to thinking about 

the future potential 
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As a doctoral student he took part in an 
expedition to the virgin forests of Gunung 
Mulu in Sarawak, on the north coast of 
Borneo. He studied the dung-beetle com- 
munities living at different altitudes and 
found that species composition changed as 
you moved up the mountain. Thirty-five 
years later, he returned and repeated the 
exercise. He found the same sequence of 
species along the transect, but the whole 
distribution had shifted upwards, prob- 
ably owing to global warming. Hanski 
laments another difference: Gunung Mulu 
is itself now an island in a sea of oil palm, 
no longer part of a much more extensive 
forest. 

Hanski wrote Messages from Islands after 
being diagnosed with cancer. His book does 
not shy away from the challenges that face 
humanity or the threats to biodiversity. Yet 
it is an uplifting read, revealing the author's 
humanity and deep love of the natural world. 
I cannot think of a better book to give to a 
general reader who wants an insight into 
modern ecology and how ecologists go 
about their trade. m 


H. Charles J. Godfray is in the 
Department of Zoology at the University of 
Oxford, UK. 

e-mail: charles.godfray@zoo.ox.ac.uk 
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Trump: China could 
take lead on climate 


Donald Trump has questioned 
the science behind climate change 
and committed the United States 
to withdraw from the Paris 
agreement, raising concerns 

that the US contribution to 
international climate funds will 
cease (see also Nature http:// 
doi.org/bs57; 2016). This could 
cause some of the least-developed 
countries to lose interest in the 
Paris process and make many 

of its essential elements much 
harder to implement. 

The Paris agreement is 
supposed to start its first review in 
2018 — a process that is already 
fraught with political difficulties 
that call for strong leadership 
from the United States, Europe 
and China. If the United States 
pulls out and Europe is distracted 
by problems such as Britain’s 
upcoming exit from the European 
Union, it could cement China's 
position as a leader and shaper of 
global order. 

David Victor University of 
California, San Diego, 
California, USA. 
david.victor@ucsd.edu 


Trump: keep climate 
plans to boost jobs 


The economic insecurity of US 
voters contributed to the election 
of Donald Trump as president. 
Alarmingly, he has pledged to 
support coal usage, withdraw the 
US commitment to the 2015 Paris 
climate agreement and reverse the 
Clean Power Plan, which aims to 
regulate carbon dioxide emissions 
from power plants. Although 

the scientific community cannot 
back away from climate change, 
researchers could pay more 
attention to the economic needs 
of voters and develop strategies 
that manage climate risks while 
bolstering employment and the 
US economy. 

For example, cheap domestic 
natural gas delivered by fracking 
has created economic benefit and 
made the United States an energy 


exporter. As long as operators 
control fugitive emissions, gas 
from fracking will drastically 
reduce US carbon emissions. 

If appropriate environmental 
and social control were to be 
combined with carbon capture 
and storage — a technology best 
implemented by repurposed oil 
companies — cheap gas could 
provide both an economic anda 
climate solution. 

The few countries that have 
already decarbonized their 
electricity systems have done so 
by relying on nuclear power. A 
revitalized US nuclear industry 
could deliver large amounts of 
safer and cheaper carbon-free 
nuclear power — and provide 
jobs in design, construction and 
maintenance. 

A focus on emissions and 
technology coupled to economic 
development, not to ideology 
and causation, might be more 
successful with future voters. 
Jane C. S. Long Oakland, 
California, USA. 
janecslong@gmail.com 


Trump: unlikely to 
deter clean energy 


It is unlikely that the US 

election result alone can derail 
climate-change mitigation and 
clean-energy solutions (see also 
Nature 539, 337-339; 2016). 
The global momentum in policy 
and technology is in large part 
beyond the reach of the next 

US administration. 

Policies to support emissions 
reduction and clean-energy 
expansion are already in place 
around the world. As wellas their 
impact on climate change, these 
cut pollution, create jobs and 
boost national competitiveness. 
These are strong reasons against 
policy reversal in China and 
elsewhere. 

Some US federal regulations, 
such as fuel-economy standards, 
are not easily overturned. In 
the electricity sector, several US 
states are well positioned to make 
up for changes to federal policies 
that could otherwise limit 


climate progress, at least over the 
next 4-8 years. 

This policy momentum is 
partly due to the success of past 
policies in driving down the 
cost of renewables, through 
technological innovation and 
economies of scale in private 
firms. This has encouraged 
further growth in markets and 
in worldwide commitments 
to support decarbonization, 
which can continue despite new 
uncertainties about US federal 
policy. 

Jessika Trancik Massachusetts 
Institute of Technology, 
Cambridge, Massachusetts, USA. 
trancik@mit.edu 


Trump: threats to 
space science? 


President-elect Donald Trump 
has declared his support for 
NASAs exploration mission 

and its human space-flight 
programme in particular. As a 
professor of planetary and space 
sciences, Iam concerned that his 
opinion may be coloured by the 
flag-waving opportunities that 
these present, and that he could 
be interested in territorial rather 
than global benefit (see also 
Nature http://doi.org/bs58; 2016). 

Iam also concerned about 
statements he has made 
regarding public-private 
partnerships, and increasing the 
role of the private sector in space 
exploration. As a businessman, 
he may decide to sell off some 
of NASA’ assets to the private 
sector, without sufficient thought 
for safety and governance. 

In my view, the international 
collaborative efforts that are 
currently in place are the correct 
way to progress. More-stringent 
visa regulations could make it 
harder for UK researchers to 
take up fellowships and research 
positions in the United States, 
hampering our partnerships in 
space missions and restricting the 
sharing of data and technology. 
Monica Grady Open University, 
Milton Keynes, UK. 
monica.grady@open.ac.uk 
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Illness should not 
curtail PhD funding 


As PhD students who were 
diagnosed with cancer during 
our training, we have first-hand 
experience of the financial 
consequences of taking time 
out for treatment and recovery. 
Cutting a student's stipend 
because of serious illness is 
standard practice in the United 
Kingdom, compounding the 
combined stresses of delays to 
the research project and of the 
condition itself. 

In 3-4 years, UK doctoral 
students are expected to master 
new techniques and to generate a 
solid body of results for research 
papers and a thesis. Their 
university and funding contracts 
reflect this ‘apprentice’ status and 
come with restricted rights. 

Training grants from the UK 
research councils typically allow 
13 weeks of paid sick leave; the 
studentship is suspended if an 
illness is expected to last longer 
(see www.rcuk.ac.uk/funding/ 
grantstcs). Other funders may not 
cover sick leave at all, and most 
universities do not step in to help. 

Asa result, a student who 
is ill can be forced to continue 
working, to rely on contributions 
from friends and relatives, or to 
quit their PhD — all unacceptable 
‘punishments’ in our view. We 
suggest that training-grant 
contracts should be more similar 
to academic staff contracts in 
providing at least six months of 
paid sick leave. 

Edwin S. Dalmaijer University 
of Oxford, UK. 

E. A. Claudia Pama University of 
Cambridge, UK. 

Stella Prins University College 
London, UK. 
edwin.dalmaijer@psy.ox.ac.uk 


CORRECTION 

Lihua Yang’s Correspondence 
(Nature 539, 168; 2016) 

should have referred to possible 
rewards to scientists publishing 
in ‘top’ international journals, 
not just in Chinese journals. 
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BRIEF COMMUNICATIONS ARISING 


Ephrin Bs and canonical Reelin signalling 


ARISING FROM A. Sentirk, S. Pfennig, A. Weiss, K. Burk, & A. Acker-Palmer Nature 472, 356-360 (2011); doi:10.1038/nature09874 


During cortical development, cell-cell interactions between ephrin 
receptors and ephrin B ligands are important for neuronal migration, 
axonal and dendritic growth, synaptogenesis, and pruning’. Deficiency 
in Reelin (Reln) disrupts radial pyramidal neuron migration and results 
in cortical layer malformation’. Recently, Sentiirk et al. claimed that 
EphB receptor-ephrin B ligand (EphB-Efnb) and Reln pathways inter- 
act genetically and that layering defects in triple Efnb1;2;3-/~ mutants 
and Efnb3~'~;Reln*/~ compound mice are similar to those present in 
reeler mice”. Here we demonstrate that, contrary to this previous report, 
neither the compound mutant Efnb3~/~;Reln*’/~ nor the triple-mutant 
Efnb1;2;3-'~ show defects of cortical layering or elevated Dab] levels. 
Although protein-protein interactions between Efnb ligands and 
Reln have been confirmed, they do not affect the radial positioning 
of migrating neocortical pyramidal neurons. There is a Reply to this 
Comment by Sentiirk, A. et al. Nature 539, http://dx.doi.org/10.1038/ 
nature/20130 (2016). 

The similarity of the reeler-like phenotype in mice that lack the Reln 
receptors Vldlr and Apoer2, the intracellular kinases Src and Fyn, or 
the adaptor protein Dab] led to the identification of the canonical Reln 
pathway: the binding of Reln to its receptors induces Dab1 phospho- 
rylation (pDab1) via Src-family kinases, regulating actin and tubulin 
dynamics of the cytoskeleton necessary for orderly radial migration of 
cortical neurons. Degradation of pDab1 occurs through ubiquitination, 
so levels of unphosphorylated Dab1in reeler mice are elevated. Dab1 
binds to the intracellular NPXY motif of Vidlr and Apoer2 via its phos- 
photyrosine binding domain, yet it is unclear how Src-family kinases 
are recruited to phosphorylate Dab1. Sentiirk et al. suggested that this 
role was filled by ephrins”. Ephrins are membrane-anchored proteins 
that control axon guidance and other forms of cell-cell communication, 
including at the synapse. Cellular communication is bidirectional 
through kinase activation in the Eph-expressing cell and effector 
recruitment on the Efn-expressing cell®. Sentiirk et al. reported that 
the ectodomains of Efnb2 and Efnb3 bind to Reln’. We confirmed that 
Efnb3 binds to Reln, whereas Efnb1 does not*. In addition, genetic 
interaction of Efnb genes with Reln was implicated through the analysis 
of compound Efnb3~' —:Relnt!— animals, which Sentiirk et al. reported 
to exhibit a similar phenotype to the reeler mouse”. On the basis of their 
results they concluded that recruitment of Src or Fyn requires Efnb 
ligands, making Efnb1-3 essential for canonical Reln signalling and 
neuronal migration during brain development”. We have been unable 
to reproduce key elements of the mechanism reported by Sentiirk et al.” 

We previously reported that Reln also directly binds to EphB 
receptors‘. Furthermore, Reln induces EphB2 clustering, tyrosine phos- 
phorylation and degradation, though this is less efficient than by Efnb1. 
This signalling mechanism was independent of the canonical Reln 
receptors Apoer2 and Vldlr or of Dab1 phosphorylation‘. To determine 
whether EphB receptors play a similar role in the radial migration of 
cortical pyramidal neurons as described for Efnb ligands (particularly 
for Efnb3~’ ~;Reln*'~)?, we examined the brains of Ephb2"' ~sRelnt!— 
and Efnb3~'~;Reln *'~ mice (Efnb3~'~ mice have been described in 
detail before*). In contrast to Sentiirk et al.2, we were unable to detect 
any malformations of cortical pyramidal neuron layering in either 
of these genotypes. Nissl staining (Fig. la-d), NeuN (Fig. 2a), Brn1 

(Fig. 1i-l, q-v), and Tbr1 (Fig. lm—p) immunoreactivity and analysis 
of the distribution and polarity of Thyl-GFP-M-positive neurons 
(Fig. le-h and Supplementary Fig. 1) failed to reveal any obvious cor- 
tical layer malformation. In addition we did not find the described 
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difference in the number of NeuN-positive neurons in the corti- 
cal marginal zone (Fig. 2a, c), the hippocampal CA1 stratum oriens 
(Fig. 2b, c) or the cerebellar molecular layer” (Fig. 2b, c) when com- 
paring Efnb3~'~;Reln*'~ mice to their Efnb3~'~;Reln*’* littermates. 

During development, Cajal—Retzius cells secrete Reln and express 
a set of different EphB and Efnb proteins on their surface. Early in 
cortical development, Cajal—Retzius cells in Ephb1;2;3 ~~ mutant mice 
disperse less efficiently, resulting in large gaps between neighbouring 
Cajal—Retzius cells in the marginal zone, most prominently in the 
caudal region of the hemispheres®. We therefore analysed the layering 
of the caudal cortex of Efnb3~/ —;Reln*!— mice, but could not find 
any differences in this anatomical locale either (Fig. li, k, m, 0, q-s). 
However, we recorded a non-Mendelian inheritance pattern in 
Efnb3~'~;Reln*!~ x Efnb3~'~;Reln*'~ and Ephb2~'~;Reln*'~ x 
Ephb2~'~;Reln*!~ genotype crosses at weaning age, with only 8% 
(12 of 150, P< 1.5 x 10~° by y? test) or 16% (27 of 174, P< 0.005, 
x? test) of double-knockout mice surviving, respectively. The survival 
of mice homozygous for the reeler mutation in the Reln compound 
Efnb3~'~ line was significantly lower than in the Ephb2~'~ line 
(P <0.05, two-tailed Fisher’s exact test). 

We next attempted to reproduce the reported reeler-like phenotype 
of the Efnb1;2;37/ ~ triple knockouts”. To generate Efnb1;2;3~/ ~ triple 
knockouts we used a Nestin-cre transgenic line, as was used by Sentiirk 
et al.”, and combined it with Efnb1'*”” and Efnb2"**"** conditional alleles 
and an Efnb3 null allele. In the developing central nervous system, 
the Nestin promoter is active from embryonic day (E)7.75 onwards’ 
and Nestin—cre-driven recombination can be detected as early as E11 
(ref. 8). Our analysis of Efnb1 32;3-/- triple knockouts did not reveal 
abnormal cortical pyramidal neuron migration (Figs 1s, v, 2a) or 
canonical Apoer2/Vldlr-dependent Reln pathway disruption (Fig. 2d, e). 

Analysis of Ephb1;2;3 ate triple knockouts for the three predominant 
EphB receptors expressed in the brain also failed to reveal cortical layer- 
ing malformations (Figs 1s, v, 2a) or canonical Reln pathway disruption 
(Fig. 2d, e). 

During the phase of pyramidal neuron migration (E11-E17) Reln is 
produced by Cajal—Retzius cells at the pial surface, whereas Apoer2 and 
Vldlr are expressed by migrating pyramidal neurons. Temporal expres- 
sion of Ephb and Efnb mRNAs in the neocortex, as revealed by the Allen 
Developing Mouse Brain atlas’, showed expression of Efnb1, Efnb2, 
Ephb1 and Ephb2 from E11.5-E18.5. Efnb3 and Ephb3 expression was 
extremely low in the cortex. By contrast, almost all Ephb and Efnb genes 
were highly expressed in the developing hippocampus, where they are 
essential for proper layering and axonal and dendritic outhgrowth*!0"". 

The Efnb3~'~ mutant mouse strains used by Sentiirk et al.” and 
our study were generated by different knockout strategies. Sentiirk 
et al.” used the Efnb3-/~ mutant described previously”, in which a 
lacZ knock-in at the initiating methionine codon in the first exon 
of Efnb3 generates a true null allele. By contrast, the Efnb3 mutation 
used in this study involves the insertion of a neo cassette into Efnb3 
intron 4 (Efnb3"""?)°, which causes the loss of any detectable 
protein expression}. Phenotypic expression of the gene defect in both 
strains is indistinguishable: examples include a unique locomotor 
defect, cortico-spinal projection defect and impaired long-term 
potentiation?!?1#15, Taken together, the data are consistent with 
the Efnb3 mutant developed previously” being a null allele, rather 
than merely hypomorphic. There is ample evidence that differences 
in genetic background can contribute to the severity of mutant 
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Figure 1 | Efnb ligands are not essential for Reln-guided radial pyramidal 
neuron migration. a—d, Nissl staining in the somatosensory cortex of adult 
mice with the indicated genetic mutations. e-~h, Thy1-GFP-labelled layer V 
neurons of adult mice (double-headed arrows). In all genotypes a few ectopic 
cells were found in the retrosplenial area (arrows). Previously described 
prolonged fibres in the hippocampal infrapyramidal bundle (arrowheads)’”, 
dispersion of CA3 pyramidal cells (asterisk), reduced volume of the 

granule layer (dagger)*"! and agenesis of the corpus callosum (ACC, found 
occasionally) are indicated. i-p, Brn1 (i-I) and Tbr1 immunoreactivity 
(m-p) of P3 cortices. q-v, Brnl and NeuN in the adult cortex. CC, corpus 
callosum. Images are caudal (i, k, m, 0, q-s) or rostral (j, 1, n, p, t-v) of the 
somatosensory cortex. 


phenotypes. However, the effects of Reln knockout on neuronal 
layering and the effects of Efnb3 knockout on locomotion and mossy 
fibre pruning are very robust in a variety of genetic backgrounds. 
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Figure 2 | Efnb3~'—;Reln*'~ mice lack typical features of reeler mice such 
as over-migration of pyramidal neurons and elevated Dab] levels. 

a, b, NeuN in the somatosensory cortex (a), cerebellum and hippocampus (b). 
Distinct layers I and V (double-headed arrows) are apparent, except for 

in Reln~'~ mice. c, Quantification of NeuN labelled neurons in different 
layers as indicated. Source data for the graphical representations are 
provided in the Supplementary Information. d, e, Dab1 levels (whole 

brain) did not differ significantly between Efnb3~'~ and Efnb3~'~;Reln*!~ 
or Ephb2~'~ and Ephb2~'~;Reln*!~ mice. In Efnb1;2;3~'~ (*P < 0.05), 
Ephb1;3-'~ (**P < 0.005), or Ephb1;2;3-'~ (*P < 0.05) mice, Dab1 

was slightly decreased, and increased threefold in Ephb2~'~;Reln-/~ 

(***P < 0.0005) mice; NS, not significant. Two-tailed Student's t-test. For 
western blot source data, see Supplementary Fig. 2. CA1, CA3, hippocampal 
areas; DG, dentate gyrus; GCL, dentate granular; ML molecular layer; SO, 
strata orions; WT, wild type. 


Although it is possible that Sentiirk et al.? saw phenotypic variations 
in their Efnb3 —'-.RelInt!- and Efnb1;2;3 ~—!~mice that depend on genetic 
background, this would merely reaffirm that Efnb ligands are in fact 
not essential for Reln pathway activation. 

Our findings show that Efnb ligands and EphB receptors are not 
required for Reln-guided primary radial migration in the neocor- 
tex, whereas possible roles in pyramidal neuron axonal or dendritic 
growth, pruning, synaptogenesis and/or synaptic plasticity remain to 
be determined. 


The source of mutant mouse lines, their breeding, as well as immu- 
nohistochemical and biochemical techniques, are described in the 
Supplementary Information. 
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REPLYING TO T. Pohlkamp et al. Nature 539, http://dx.doi.org/10.1038/nature20129 (2016) 


In 2011, our laboratory reported that cortical pyramidal neurons 
fail to migrate properly in Efnb1;2;3~’~ triple knockouts and in 
Efnb3~‘~;Reln*’~ compound mice!. We provided genetic and bio- 
chemical evidence for an interaction between ephrin B (Efnb) ligands 
and Reelin (Reln) signalling during the development of laminated 
structures in the brain, such as the cortex, hippocampus and cerebellum. 
In the accompanying Comment’, Pohlkamp et al. now revisit this 
question; however, they use a different Efnb3 knockout strategy. 
Although the authors” confirm our observation! that Efnb ligands 
and Reln interact genetically in vivo, as indicated by the increased 
in utero mortality of double knockout Efnb3~’~;Reln ’~ embryos, they 
report their inability to reproduce a subset of the migration phenotypes 
originally reported by us. One likely explanation for this apparent dis- 
crepancy is the difference in the Efnb3 knockout targeting strategy used 
in the two studies’. The Efnb3 allele used by us has a deletion in a large 
part of the first exon? and was confirmed null by Southern blot analysis 
(see figure 1 of ref. 3). Pohlkamp et al.? used the mutant Efnb3”°?"” in 
their analysis, which has been previously described by these authors to 
be hypomorphic (that is, a partial loss-of-function allele)*. Therefore, 
the possibility that residual Efnb3 activity could rescue the specific 
subset of cortical migration phenotypes they fail to reproduce in their 
model cannot be excluded. Previous data stating that the Efnb3"°/"° 
does not reproduce the long-term potentiation defects observed in our 
complete loss-of-function allele for Efnb3 strongly indicates that this is 
the case*®. Moreover, the two alleles have also been reported to differ 
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in their effects on synapse formation’. Unfortunately, Pohlkamp et al.” 
cite neither this evidence nor their own previous work>”. 

In addition to the possible effects caused by differing allele 
strengths, any differences in the penetrance of the phenotypes may 
also be partially attributed to differences in the genetic background of 
the mouse strains used. In this case, different sensitivity to Reln gene 
dosage in Efnb3~/~;Reln*!~ compounds might be compensated by 
Efnb-independent mechanisms, as already proposed in our original 
manuscript’. 

Most notably, the complete absence or low quality of analysis 
performed by Pohlkamp et al.” (for example, they perform no 
quantification, show single examples of low-magnification images and 
use poorly stained samples with high background noise) should pre- 
vent any meaningful conclusions from being drawn and does not sup- 
port the strong statements made by the authors. As a further example 
of this, Pohlkamp et al.* disregard some of the phenotypes that are 
apparent in their own representative images—they do not refer to the 
ectopic placement of neurons in the upper layers of the cortex in the 
Efnb3~';Reln'~ compound mice (as can be seen when comparing 
figure la and 1b of their Comment?) or to the lack of projections 
from CA1 pyramidal neurons to the stratum lacunosum-moleculare 
(see, among others, figure 1h of their Comment’). 

It should be noted that, since publication of our paper’, crosstalk 
between Reln and the EphB-Efnb system has been confirmed both 
biochemically’ and in vivo’, notably also by the authors of the 
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Contesting the massacre at Nataruk 


ARISING FROM M. Miraz6n Lahr et al. Nature 529, 394-398 (2016); doi:10.1038/nature16477 


Mirazon Lahr et al.' present the earliest evidence of inter-group warfare 
at the East African site of Nataruk. Their evidence of warfare is based 
on three inferences: that the skeletons were all contemporaneous, that 
the bodies were left unburied, and that most individuals exhibited 
perimortem trauma consistent with interpersonal violence. We believe 
the data suggest that the burials are not contemporaneous and that 
most of the observed cranial damage is inconsistent with blunt force 
trauma. Therefore, the inference of inter-group warfare is premature. 
There is a Reply to this Comment by Mirazon Lahr, M. et al. Nature 
539, http://dx.doi.org/10.1038/nature19779 (2016). 

Miraz6n Lahr et al.' suggest that the skeletons at Nataruk document 
a single-event massacre of a forager band, after which the bodies 
remained unburied near the edge of a lagoon. The case for warfare 
is based on observations of alleged perimortem trauma to infer 
multiple simultaneous fatalities. Although the embedded obsidian 
lithic in KNM-WT 71251 is suggestive, interpretations of perimortem 
blunt force trauma on most crania are inconsistent with the forensic 
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anthropology literature*®. We discuss three of the more compelling 
examples here. 

Miraz6n Lahr et al.' describe a radiating fracture on the left frontal 
of KNM-WT 71253 as produced by an inferred strike to the left 
parieto-temporal region that was unobservable owing to erosion of 
the posterior vault. A similar fracture is noted on the right frontal. Both 
fractures are transversely oriented midway between the coronal suture 
and the supra-orbital margins and, importantly, are unconnected. 
Given the lack of an impact site, the right frontal fracture must be 
taphonomic, providing strong evidence for the presence of conditions 
conducive to post-depositional warping and cracking. This interpreta- 
tion is supported by the directional change exhibited by this fracture—a 
feature shared, to a lesser extent, by the fracture on the left frontal 
described as resulting from blunt force. Such changes in direction 
have been associated experimentally with postmortem weathering 
processes’ (Fig. 1a, b) and fractures with this morphology are not inter- 
preted as evidence for trauma in the bioarchaeological literature*'. 


Figure 1 | Examples of cranial vault 
taphonomy and trauma from modern 
forensics cases. a, b, Radiating fractures 
resulting from exposure and weathering, note 
the similar appearance of 1A to KNM-WT 
71257, and the similar appearance of 1B to 
KNM-WT 71251; c-e, and inferior portion of 
g, vertically oriented fracture lines resulting 
from blunt force trauma to the frontal bone; 

f, radiating and concentric fractures resulting 
from blunt force trauma to the right temporo- 
parietal with convergence of radiating fractures 
at impact point; g, wedge-shaped fragments 
resulting from repeated blows to the right 
parietal, note chipping of fracture edges, 
separation of internal and external tables, and 
absence of fractures oriented in a grid-like 
pattern. 
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Given these observations and the erosion of the posterior cranial 
vault, we believe that the left frontal fracture is also taphonomic 
in origin. 

KNM-WT 71264 shows a large horizontal fracture on the frontal 
bone interpreted as evidence of blunt force trauma. However, the 
fracture pattern is inconsistent with the expectations of a direct impact 
to this region and the mechanics of fracture propagation. The mor- 
phology of the fracture is irregular, exhibiting multiple abrupt changes 
in direction with a transverse orientation that crosses the midline. 
Numerous studies indicate that blunt force trauma to the frontal squama 
results in vertically-oriented fractures due to the midfrontal buttressing 
of the cranial vault?*”? (Fig. 1c-e, g). Fractures approaching such 
buttresses obliquely, as in the images provided by Miraz6n Lahr et al.', 
typically divert towards areas that are less structurally reinforced’. This 
is inconsistent with the pattern observed in KNM-WT 71264. When 
fractures of the frontal bone are oriented transversely, they tend to be 
located more inferiorly and traverse the area of weakness associated 
with the frontal sinus, which is conspicuously intact in the images 
provided by Mirazén Lahr et al.'. In a large sample of experimental 
(n=195) and clinical (n =290) cases of frontobasal fractures, Manson 
and colleagues’” recorded only three different patterns of fracture 
propagation on the frontal. None of these match the appearance of 
KNM-WT 71264. 

For KNM-WT 71257, Mirazon Lahr et al.! describe an impact to 
the left temporal causing concentric and radiating vault fractures, 
basicranial displacement, and a radiating fracture affecting the right 
side of the facial skeleton. Focal blunt force trauma to the cranial 
vault typically results in depression at the point of impact, producing 
localized in-bending of the vault with consequent out-bending in the 
surrounding area. As these processes progress, radiating fractures 
propagate outward from the point of impact and concentric fractures 
may form perpendicular to the radiating fractures, encircling the 
impact site. This configuration of fractures typically results in wedge- 
shaped pieces of bone” ** (Fig. 1f, g). This is not seen in the images 
provided by Mirazén Lahr et al.'. Rather, breakage of the left temporal 
region has a tessellated appearance resulting from a series of parallel 
vertically oriented fractures intersected at nearly right angles by several 
horizontally oriented fractures. These fracture lines have no single 
origin that would represent the site of impact. Note also that large 
portions of the temporal squama and the greater wing of the sphenoid 
appear to be intact. Given the thinness of these bones, it is unlikely 
that a direct blow impacted this region. Although Mirazon Lahr 
et al.' indicate that damage to this region was exacerbated by soil 
compression, the observable configuration of fracture lines is incon- 
sistent with what is expected to result from blunt force trauma to the 
temporal region. The same argument applies to the inferred blunt force 
trauma to the left temporal bone of KNM-WT 71264. 

Perimortem trauma is difficult to diagnose in ancient remains with 
complex taphonomic histories. Miraz6n Lahr et al.' base their inter- 
pretations on six criteria, but their assessment of these in the Nataruk 
skulls is open to question (see above), and several criteria may have 
limited utility in discriminating perimortem trauma from postmortem 
damage!*"4, We suggest soil compression is the primary cause of 
damage in KNM-WT 71253, 71264 and 71257. That the burials were 
found in lakebed sediments indicates they were submerged in antiquity 
and may have decomposed under conditions that increased bone 
malleability through mineral leaching’. This interpretation is supported 
by an association between in situ skull orientation and the location 
of primary cranial vault damage and contour disruption (Table 1). 
Individuals buried face down show primary damage in the antero- 
posterior direction, while individuals buried with their skulls on their 
side show primary damage in the medio-lateral direction. This is 
unlikely to be a coincidence. Although difficult to diagnose from the 


Table 1 | Body positioning and dating of the 12 in situ Nataruk 
burials 


Primary 
location 
Burial Body Skull Uranium 14C shell of vault 
number position orientation series date* age (cal BP) damage 
71251 prone facedown 12,200+700 10,275-10,190 Anterior 
71253 prone’ facedown 12,200+730 Anterior 
9,540+ 860 
71264 prone facedown 48,300+2900 10,480-10,470 Anterior, 
10,420-10,230 _ lateral 
71263 side/ partly Anterior? 
pronee down 
71265 partly Anterior 
down® 
71254 side onside 6,640 +460 9,290-9,030 Medio- 
lateral 
71257 — side on side 5,960+550 Medio— 
6,670+510 lateral 
71258 _ side 9,460-9,260 
71259 seated face 5,590+450 
down! 6,590+740 
71255 seated face up® 
71256 supine faceup 
71260 — side onside 


*Only uranium series dates that do not violate the diffusion-adsorption-decay model assumptions 
are included here. Data for both columns taken directly from the ‘Age’ column in extended data 
table 1 in Miraz6n Lahr et al.?. 

‘Torso and skull are prone with knees flexed to the chest. 

‘It is difficult to reconstruct the burial position of this individual based on the images provided in 
Miraz6n Lahr et al.!. Figure 2j in ref. 1 clearly shows a prone posture; however, this is difficult to 
reconcile with the composite image presented in Fig. 1 and extended data figure 7a in ref. 1. 
‘Fractures do not fit within scoring scheme as there is no real vault deformation, face is absent 
with primary inferred trauma to the frontal. 

®Mirazon Lahr et al.! describe this ‘as the face was partly turned down into the ground’. 

‘Mirazén Lahr et a/.! describe the skull orientation as face down, but also state the cranium was 
exposed and not recovered. 

®Miraz6n Lahr et al.! infer this based on the orientation of the cervical vertebrae. 


images provided, inferred postcranial damage is also consistent with 
taphonomic changes caused by wet-dry cycling, soil compression, and 
root, animal and insect activity; none are given serious consideration 
in a differential diagnosis by Mirazon Lahr et al.'. 

Furthermore, although interpreted conservatively as minimum 
age estimates, uranium series dates are associated with burial posture 
(prone versus non-prone). Importantly, the same pattern holds for 
radiocarbon dates based on shell (P=0.012), which are not subject 
to the same interpretive limitations. The oldest dates are recorded 
for prone burials and the youngest dates associate with individuals 
interred on their side (Table 1). This pattern suggests a multi- 
component chronology consistent with mortuary sequences in Sudan 
where multiple prone inhumations have been identified in attritional 
cemetery contexts). 

Finally, it is unlikely that the Nataruk skeletons were unburied and 
deposited simultaneously as suggested by Miraz6n Lahr et al.'. Their 
interpretation is based on three observations: (1) the lack of burial 
pits, (2) variation in body position, and (3) the association between 
fish bones and human remains. The absence of burial pits is common- 
place in mortuary sites throughout the world, especially of this age. 
Unlike well-documented massacre sites, the bodies at Nataruk are 
mostly articulated, spatially organized, non-commingled, and preserve 
limited variation in body positioning for which clear categories can be 
defined (see Table 1). Their posited non-burial scenario is not the only 
way for shell and fish bone to adhere to human bone within the post- 
depositional environment. The interpretation of Mirazén Lahr et al.' 
requires the decomposition of dozens of bodies at the edge of a lagoon, 
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without evidence of disarticulation or animal scavenging, long enough 
for them to be covered by lakebed sediments in an ephemeral and low 
energy depositional environment. This is unlikely, and in the case of 
the seated burial (71255) not possible. Interpersonal violence was surely 
present in early Holocene African hunter-gatherers, however, the case 
for a massacre at Nataruk is not supported by the data Mirazon Lahr 
et al.' report. 


Methods 

Data for Table 1 were extracted from Mirazon Lahr et al.'. We only include 
uranium series dates that did not violate the diffusion-adsorption-decay model 
assumptions. Damage location is based on the anatomical plane of the cranial vault 
that shows the most significant displacement in its contour. Assessments of blunt 
force cranial trauma are based on images and descriptions of fractures. Figure 1 was 
created using modern forensic cases from Maricopa County, Arizona as exemplars 
of typical patterning resulting from cranial blunt force traumata. t-test is based 
on the uncalibrated midpoint of radiocarbon dates using a two-sided test and the 
Satterthwaite approximation. 
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Mirazon Lahr et al. reply 


REPLYING TO C. M. Stojanowski et a/. Nature 539, http://dx.doi.org/10.1038/nature19778 (2016) 


In the accompanying Comment', Stojanowski et al. challenge 
the evidence for inter-group conflict at Nataruk”. They make two 
arguments—first, that the lesions in three crania are due to soil 
compression; second, that there is a correlation between body position 
and age, reflecting different burial traditions. We believe that their 
interpretation is incorrect on both counts. 

Stojanowski et al.' argue that the fractures in three Nataruk skulls do 
not meet forensic criteria, the criteria are unreliable, and the location 
of lesions is linked to skull/face orientation. We focus on cranium 
71264, argued by Stojanowski et al.‘ as an example of soil compression 
effects. This skull has two primary lesions (challenging any site-wide 
association between skull trauma location and face orientation), one on 
the left temporoparietal area, another on the frontal bone; these show 
clear in-bending of bone at impact-point and out-bending of affected 
regions, radiating fractures, directional change, depressed adhering 
bone fragments, lesions across sinuses, evulsion of molars, and 
differential lateral displacement of maxilla and mandible. We know of 
no selectively targeted, multi-directional soil compression agent that 
could account for the observed pattern in this, or other Nataruk crania. 
The criteria used for identification of perimortem trauma are not only 
consistent with the literature*, but clearly identify multiple traumatic 
lesions at Nataruk’, 
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Stojanowski et al.' also argue that prone individuals at Nataruk 
are older than the rest, and that the site must be a cemetery, with 
sequential burial traditions, to have preserved the remains. To support 
this, they (1) claim that there are four prone individuals at Nataruk, 
not two, by redefining the word ‘prone’ thus including two skeletons 
found flexed on their side’; (2) dismiss the absence of burial pits or the 
shells/fish adhering onto the bones as irrelevant; (3) do not account 
for the fact that bodies and body parts were strewn over 20,000 m?; (4) 
incorrectly state that the majority of human remains at Nataruk were 
articulated and uncommingled; (5) argue for ‘limited variation in body 
positioning’ (figure 1 from ref. 2 shows clearly the diversity of position/ 
direction of bodies, limbs, heads and faces at Nataruk); and (6) draw 
parallels with cemeteries in Sudan°. These are rich multi-stratified, 
younger sites (~8,000-2,000 years BP), in one of which earlier burials 
(with residual burial pits) were prone, later Neolithic ones, flexed?. 
However, prone skeletons in Sudan, although rare, are found across 
time—from 30,000-20,000 years ago at Wadi Kubbaniya‘® to as recent 
as Ist millennium Bc at Jebel Moya”, and preserved unburied bod- 
ies found before in East Africa’. In fact, there are no chronological, 
stratigraphic or internment parallels between the Omdurman sites and 
Nataruk, as there are no universal East African pre-Neolithic burial 
traditions. 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


BRIEF COMMUNICATIONS ARISING 


Nataruk is not a cemetery, the majority of remains were disarticulated 
and dispersed; the human remains’ multiple cranial and post-cranial 
traumas are more consistent with perimortem traumatic lesions than 
soil compression, and critically include embedded projectiles. There is 
no association between body position and age, or face orientation and 
location of fractures. A case of inter-group conflict remains the best 
explanation of the events at Nataruk. 
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DNA REPAIR 


Clamping down on copy errors 


Repair enzymes must communicate across hundreds of nucleotides to undo errors made during DNA replication. Imaging 
reveals that the enzymes do this by forming a series of ring-like clamps that diffuse along the DNA. SEE LETTER P.583 


NEIL M. KAD & BENNETT VAN HOUTEN 


s we write up results for publication, 
A= spelling errors are quickly identi- 

fied by our computer's built-in spell- 
checker. However, this is not an invention of 
the modern world; nature has been using DNA 
spellcheckers for millions of years to avoid 
genetic errors that arise during DNA replica- 
tion, which it corrects through a process called 
mismatch repair. In the bacterium Escherichia 
coli, repair requires communication between 
enzymes across long stretches of DNA, and 
how this occurs has been hotly debated for 
decades’*. On page 583, Liu et al.* help to 
solve this mystery by using state-of-the-art 
techniques to analyse mismatch repair at the 
single-molecule level. 

DNA exists as a double-stranded duplex, 
connected across the strands by complemen- 
tary base-pairing — guanine (G) with cytosine 
(C) and adenine (A) with thymine (T). Dur- 
ing DNA replication, the duplex unwinds, 
and each strand is copied using new nucleo- 
tides to create a freshly synthesized daughter 
strand. Errors occur when incorrect bases are 
incorporated into the daughter strand, cre- 
ating a mismatch that requires subsequent 
repair to prevent mutations from arising. In 
E. coli, DNA with the base sequence GATC 
is normally tagged with a methyl group, but 
the newly synthesized DNA is temporar- 
ily unmethylated. These ‘hemi-methylated’ 
regions provide a means for mismatch-repair 
enzymes to distinguish between parent and 
daughter DNA. 


i MutS ® 
Mismatch 


Hemi-methylated sequences are also the 
sites at which the endonuclease enzyme MutH 
generates a single-strand break in the erro- 
neous DNA strand during mismatch repair. 
Because of the rarity of GATC sequences, 
MutH-generated breaks can occur hundreds of 
bases from the site of the mismatch. Following 
breakage, the damaged strand is excised by an 
exonuclease enzyme that works from the break 
to the error, and the DNA is resynthesized to 
incorporate the correct base. A key aspect of 
this process is the coordination of mismatch 
detection with loading of the DNA-excision 
machinery, to ensure that excision occurs 
towards, rather than away from, the mismatch. 

For the past decade, it has been thought? 6 
that the protein MutS, which recognizes 
DNA mismatches, acts as a sliding clamp — 
a ring around DNA that can efficiently move 
up or down the two strands, enabling com- 
munication between the mismatch and the 
hemi-methylated sites. A third protein, MutL, 
activates the cutting activity of MutH. But how 
MutS links mismatch detection to the activa- 
tion of MutH has been unclear. 

To study initiation of mismatch repair, Liu 
et al. added fluorescent tags to MutH, MutS 
and MutL and analysed the movements of 
the individual proteins along single DNA 
molecules. They found that MutL is loaded 
onto DNA through interactions with a MutS 
molecule that has already located a mismatch. 
This observation builds on a structural study’ 
showing that the DNA duplex is positioned 
in the open channel of the MutS ring follow- 
ing mismatch verification, and that MutL 


Figure 1 | Scouting for errors. a, Following DNA replication, the protein 
MutS clamps around DNA at mismatches — sites of incorrect pairing 
between the two complementary DNA strands. b, Liu et al.* report that, once 
a mismatch is detected, MutS recruits the protein MutL, and the two proteins 
slowly slide together up and down DNA. c, MutL in turn recruits the enzyme 


binding sites on MutS are revealed during 
this process, permitting formation of the 
MutS-MutL complex. 

Next, the authors confirmed another previ- 
ous observation’ — that the MutS—-MutL com- 
plex forms a single sliding clamp. Extending 
these observations, they showed that MutL can 
detach from MutS to act as a clamp on its own. 
In this role, MutL undertakes short excursions 
away from MutS. Surprisingly, some of these 
excursions pass behind MutS, an observation 
that Liu et al. took to imply that MutL might 
pass through the MutS channel. Therefore, 
MutL can be thought of as acting like a yo-yo, 
separately scouting the DNA. But scouting 
for what? 

The answer comes from the group’s 
demonstration that MutL can recruit MutH 
to the DNA, thereby setting the stage for 
the excision phase of mismatch repair. The 
MutL-MutH complex behaves similarly to 
MutL alone, detaching from MutS and diffus- 
ing along DNA more quickly than when the 
three proteins are bound together — presum- 
ably in search of hemi-methylated sites. The 
authors propose that the MutS—MutL-MutH 
complex is the dominant species, because the 
sliding-clamp nature of MutS means that this 
complex is tethered to the DNA. However, 
efficient searching is also possible with MutL- 
MutH, which Liu and colleagues posit hops 
along the DNA. 

Together, these observations might have 
solved the long-standing mystery of how 
the mismatch and excision sites interact, 
superseding previous models in which the 


d 
Fast 


c 
Slow Slow 
7 = =) —— = — 
So) MutL Rad 


498 | NATURE | VOL 539 | 24 NOVEMBER 2016 
© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


= =O 


MuthH, and the three move as one complex. d, MutL and MutH can detach 
from MutS to move rapidly along DNA in search of hemi-methylated sites 
(H) — sequences at which the parent strand is tagged with a methyl group but 
the mismatched daughter strand is not. The daughter strand is subsequently 
cleaved by MutH, initiating DNA excision and repair. 


DNA is looped through the MutS clamp 
or in which the clamp moves in a targeted 
manner along the duplex’*. The model that 
emerges from these data suggests that MutS 
acts as a guiding clamp, sending out scout- 
ing clamps to search for hemi-methylated 
GATC sequences in a highly energy-efficient 
manner (Fig. 1). 

In 2015, Paul Modrich received the Nobel 
Prize in Chemistry for his seminal contri- 
butions to our understanding of mismatch 
repair’. However, many aspects of the process 
remain unresolved. One is that mismatch 
repair in eukaryotic cells (those that have a 
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Polymers make 
charge flow easy 


nucleus) involves more-complex protein 
protein and protein-DNA interactions that 
have yet to be fully described. For instance, the 
eukaryotic equivalent of MutL has endonucle- 
ase activity, and eukaryotes use different mech- 
anisms for excision depending on whether 
the break point is 3’ or 5’ of the mismatch’”. 
Given this complexity, a better understand- 
ing of eukaryotic mismatch repair will surely 
present new mechanistic surprises. The use of 
single-molecule approaches to enable analysis 
of the eukaryotic process, already under way in 
several laboratories, will be central to elucida- 
ting these systems. 

The principles outlined by Liu et al. might 
apply to enzymes involved in other types of 
DNA repair. A recent report’ revealed that 
the yeast protein Rad4, which is involved in 
nucleotide-excision repair, does not directly 
bind damaged sites, but instead diffuses for up 
to 1 kilobase around the damage, providing a 
dynamic platform for the recruitment of other 
repair proteins. Furthermore, the protein Mfd, 
which dislodges RNA polymerase enzymes 
that become stalled while transcribing DNA, 
then scouts ahead for DNA damage on the 
transcribed strand”. 

The next chapter of the mismatch-repair 
story will surely see experiments that follow 
the entire process of repair, enabling observa- 
tion of every protein in real time in a single 
assay. With single-molecule fluorescence tech- 
niques developing rapidly, such experiments 
are in sight, and should provide a molecular 
understanding of biology in vitro that can be 
adapted to systems in vivo. m 
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Organic semiconductor devices require good electrical contacts with conducting 
materials, but such contacts are often inefficient. An approach that tackles this 
problem will enable a wide range of applications. SEE LETTER P.536 


ANTONIO FACCHETTI 


nconventional optoelectronic devices 

| such as thin-film transistors, solar cells 
and light-emitting diodes, in which 

organic semiconductors are key components, 
will potentially give rise to new display, circuit, 
lighting, imaging and energy-harvesting 
products’. All of these devices comprise a 
multilayer architecture in which a thin film 
of a conductor — typically, a metal or a trans- 
parent conducting oxide — is in contact with 
a semiconductor. To optimize the transfer of 


a High work function 
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electric charge between these two materials, 
an interfacial layer (IFL) is usually introduced, 
but the resulting architectures are often 
inefficient. On page 536, Tang et al.* show that 
IFLs created froma particular type of polymer 
can improve both the performance and the 
stability of optoelectronic devices. 

Several approaches have been developed to 
engineer IFLs, and these mainly depend on 
whether most of the charges that cross the con- 
ductor-semiconductor interface are electrons 
(negative charges) or holes (positive charges)”. 
For electron exchange, the energy needed to 


Conductor 


IFL 


Semiconductor 


Figure 1 | From chemistry to optoelectronics. Tang et al.* have developed a strategy to fabricate 
optoelectronic devices in which electrical charges can flow freely between the conductor and 
semiconductor components. a, To achieve this, the authors use organic semiconductors called 
self-compensated m-conjugated polymers as an interfacial layer (IFL) between the two materials. These 
polymers consist of ‘backbones’ to which positively charged holes (h*) can be added, referred to as hole 
doping. The charges of these holes are balanced by negatively charged ions (X-) that are chemically 
bonded to the polymer backbone by electrically insulating ‘linkers. Tang and colleagues show 

that hole doping such a polymer increases the energy that is needed to remove an electron from the 
conductor (the work function), allowing holes to be freely transferred through the IFL. b, Conversely, 
the authors show that by adding electrons (e°) to the polymer backbone — electron doping — the charges 
of the electrons are balanced by positively charged ions (Y*), and the system has a low work function, 
enabling electrons to be freely transferred through the IFL. 
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remove an electron from the conductor, called 
the work function, should be low and similar 
to that of the empty conduction energy band 
of the semiconductor. Conversely, for hole 
exchange, the work function should be high 
and match that of the semiconductor’s filled 
valence band. 

The most prominent families of materials 
used for IFLs include polyelectrolytes, metal 
oxides, self-assembled mono- or multilayers 
of organic molecules and mixtures of these®”. 
Each of these families has its strengths and 
limitations. However, using only one type of 
IFL family has made it difficult to secure use- 
ful properties, such as the ability to tune the 
work function over a broad range of energies, 
large conductivities, equally efficient hole and 
electron transfer and IFL stability. 

Tang and colleagues have constructed 
IFLs using organic semiconductors called 
m-conjugated polymers, which allow efficient 
inter- and intramolecular charge transport 
to occur between the conductor and semi- 
conductor materials*. Most m-conjugated 
polymers cannot transfer a large number of 
charged particles. However, the authors use 
a chemical doping process — the addition of 
either holes or electrons — to dramatically 
increase the density of charge carriers in the 
‘backbone’ of these polymers and, in turn, their 
electrical conductivity (Fig. 1). 

In conventional doped polymers, the charge 
of the delocalized carriers in the backbone is 
balanced by molecular counterions (ions that 
have the opposite charge to that of the carri- 
ers). However, Tang and collaborators’ poly- 
mers are self-compensated — the balance of 
charge is obtained intramolecularly owing 
to counterions that are chemically bonded to 
the polymer backbone by electrically insulat- 
ing linkers. Such polymers, particularly hole- 
doped ones, have long been studied’, and 
recent reports have extended the repertoire to 
electron-doped varieties’. The authors make 
the case for the unique properties of these 
polymers and their broad applicability in a vast 
array of optoelectronic devices. 

In Tang and colleagues’ polymers, doping 
and self-compensation are achieved in two 
separate steps, potentially improving control 
of the level of doping and the purity of the 
resulting materials. Furthermore, this con- 
trol can be achieved both in solution and in 
thin films, broadening the range of doping 
materials and polymer structures that could 
be used for IFL fabrication. The array of self- 
compensated polymers includes hole-doped 
triarylamine—fluorene copolymers that can 
provide ultra-high work functions (up to 
5.8 electronvolts) and electron-doped naphtha- 
lene-bis(carboxyimide)—fluorene copolymers 
that can have ultra-low work functions (down 
to 3.0 eV). Such a wide range of values allows 
the transfer of charge between the conductor 
and semiconductor materials to be optimized. 

In every multilayer optoelectronic device, 


chemical stability of the material components 
is of paramount importance for operational 
stability. This aspect is of even greater concern 
when using doped polymers, because ions can 
diffuse across the interfaces between layers, 
particularly in the presence of an electric field. 
However, Tang et al. show that, because the 
counterions in their self-compensated poly- 
mers are covalently bonded through the link- 
ers, the diffusion of charge carriers and ions 
between layers is prevented. This molecular 
design is essential for achieving a stable con- 
centration of the carriers (a constant doping 
density) across the entire thickness of the IFL 
film and across the interface between the 
IFL and the semiconductor. 

Tang and colleagues fabricate several devices 
in which standard IFL materials are replaced 
by their polymers, including thin-film tran- 
sistors, solar cells, light-emitting diodes and 
photodiodes. The authors realization of a blue- 
light-emitting diode is particularly impressive. 
Here, they use two self-compensated polymers 
(one electron-doped and one hole-doped) and 
a benchmark organic blue-light emitter (a 
polyfluorene) to achieve electric currents and 
energy-conversion efficiencies that surpass 
those of conventional IFLs using PEDOT:PSS 
and calcium — materials that have high and 
low work functions, respectively. 

The results demonstrate the great potential 
of self-compensated polymers, and raise 
several technological challenges and scien- 
tific questions. From a chemistry perspective, 
it will be instructive to study the synthetic 
reproducibility and scale-up feasibility of these 
polymers — to check whether the doping 
level and ionic content can still be controlled, 
and how charge transport and the work 


EVOLUTION 


function are affected. Furthermore, although 
the authors demonstrate the stability of their 
materials (and the work functions of these 
materials), they do not evaluate the stability of 
the devices incorporating their IFLs accord- 
ing to commercial standards. Such protocols 
are typically extremely demanding for use in 
commercial products. 

Finally, an essential but unanswered 
question is whether self-compensation can 
result in electron-conducting polymers that 
have electrical conductivities and stabilities 
at ambient conditions that match those of 
current benchmark doped polymers such as 
PEDOT:PSS or polyaniline. However, despite 
the above-mentioned challenges, Tang and 
colleagues’ work is a step on the path towards 
unconventional optoelectronics. m 
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Insect invasions and 
natural selection 


Observations of a real-time invasion of Australia by Asian honeybees 
demonstrate how natural selection can allow a small founding population to 
overcome the genetic odds stacked against success. 


AMRO ZAYED 


ocial insects that live in colonies rank 
among the most notorious biological 
invaders. They have colonized almost 
all terrestrial ecosystems, threatening native 
faunas, domesticated livestock and human 
health’. Yet the success of social-insect invaders 
is paradoxical, because they have a sex- 
determination system that gives rise to many 
sterile or inviable males in small founding 
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populations, which lack genetic diversity’. 
Writing in Nature Ecology & Evolution, Gloag 
et al.’ show how, during the early stages of an 
invasion of the Asian honeybee Apis cerana, 
the action of natural selection can lessen this 
sex-determination problem, allowing even the 
smallest founding populations to thrive. The 
authors also provide a powerful example of 
natural selection observed in real time during 
biological invasions. 

Sex determination in bees, like that in most 
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Figure 1 | Increased frequency of rare alleles by natural selection in an invasive bee population. 

A female bee has two copies of the sex-determining csd gene and mates with a male bee, which has only 
one copy of the gene. The csd gene exists in different forms called alleles, shown in different colours. 

a, Ifa female mates with a male that has an identical csd allele, half of the resulting offspring will have 
two identical csd alleles and will develop as sterile or inviable males that do not contribute to future 
generations. The offspring that have two different csd alleles develop as fertile females. b, All of the 
offspring from a mating between a male that has a rare allele, such as the one indicated in purple, anda 
female that does not have this rare allele will develop into fertile females. This would therefore increase 
the frequency of the rare allele in future generations, until all csd alleles have the same frequency. This 
type of natural selection — called balancing selection — was confirmed in observations made by Gloag 
etal.’ during a study of invading Apis cerana bees in Australia. 


ants and wasps, is governed by the genetic 
make-up ofa single, and usually highly vari- 
able, gene, called csd in honeybees’. Different 
versions of genes are called alleles. Haploid cells 
have one allele of each gene and diploid 
cells have two (these two alleles can be identi- 
cal or different). In bees, unfertilized haploid 
eggs have a single csd allele and develop into 
haploid male bees. Fertilized, diploid eggs 
that have two different alleles of csd develop as 
female bees, which are workers or queens in 
honeybee colonies. However, under conditions 
in which there is limited genetic variation at 
csd, for example when the population is small, 
fertilized eggs can have two identical alleles of 
csd, thereby tricking the sex-determination 
machinery into producing a diploid male bee 
from a female-destined egg. 

Diploid males are sterile or inviable in the 
vast majority of bee species studied so far’. In 
honeybees, diploid males are quickly cannibal- 
ized by workers’. By converting a fraction of 
female-destined eggs into sterile or inviable 
males, the production of diploid males slows 
the growth of bee colonies that require female 
workers to function. In small, isolated popu- 
lations, the production of diploid males 
can cause colony mortality® and population 
extinction’. How do invasive social insects 
overcome this potential drawback of their 
unusual system of sex determination? To 
address this question, Gloag et al. tracked 
honeybee genetics during a nascent invasion 
of Australia by A. cerana over an eight-year 
period. 

The authors speculated that a special form 


of natural selection, called balancing selec- 
tion, would have a role in reducing imbalances 
in the frequency of csd alleles. This would, in 
turn, reduce the production of inviable 
diploid males in the invasive population. With 
balancing selection, individuals that have rare 
csd alleles would be expected to have high 
fitness because they would be unlikely to mate 
with an individual that had the same allele —a 
mating that would result in the production of 
diploid male offspring (Fig. 1). Carriers of rare 
alleles should contribute more offspring to 
future generations and increase the frequency 
of their initially rare csd alleles over time. A 
prediction about balancing selection on csd is 
that an equilibrium would be reached when all 
csd alleles have the same frequency. 

Gloag et al. painstakingly sequenced the 
functional parts of the csd gene in the A. cerana 
population to determine how it evolved during 
the invasion of Australia. They also analysed 
genetic diversity at several other regions of the 
bee genome. The invasion was first detected 
in late 2007, and the authors monitored the 
genetics of the invasive A. cerana population 
from 2008 to 2015. Ecological theory predicts 
that invasions initiated by large or multiple 
founding populations would be the most 
likely to succeed’. But Gloag et al. found that 
the genetic diversity in the invasive A. cerana 
population was consistent with a single 
introduction of a single colony. 

In 2008, the authors identified only four 
different alleles at the csd gene, and they esti- 
mated that one out of four fertilized eggs would 
give rise to an inviable diploid male. However, 
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a year later, the authors documented three 
additional csd alleles. Gloag et al. propose that 
these three alleles were so rare that they had 
escaped detection in 2008. The authors showed 
that the rare alleles were not the result of new 
mutations because the alleles’ DNA sequence 
was substantially different from those of the 
alleles found in 2008. It is also unlikely that 
these new A. cerana csd alleles were introduced 
by a secondary invasion, because the authors 
did not observe new alleles at the other DNA 
regions they examined. Rather, the presence of 
these three rare alleles in 2009 is best explained 
by balancing selection, in which bees that car- 
ried the rare csd alleles in 2008 contributed 
more offspring to the next generation. 

Over the next six years, from 2009 to 2015, 
Gloag and colleagues found that the frequency 
of each of the seven csd alleles in the invasive 
population started to converge on an allele 
frequency of one-seventh, the theoretical 
frequency expected if balancing selection is 
assumed. By 2015, the authors estimated that 
one out of ten fertilized eggs developed into a 
diploid male, representing a 40% reduction in 
female mortality caused by diploid male pro- 
duction compared with the data from 2008 — 
this is a massive improvement in the fitness of 
the invasive population. Balancing selection at 
the csd gene has been indirectly inferred from 
studies of DNA-sequence change’ in native 
honeybee populations, and Gloag and col- 
leagues’ work provides a direct observation of 
this form of selection in real time. 

Diploid males are typically found in inva- 
sive populations of ants’, bees'’ and wasps”. 
The balancing selection at the csd gene docu- 
mented by Gloag et al. might be a common 
process that enhances the success of social- 
insect invasions by correcting imbalances in 
the frequency of csd alleles that occur during 
founding events. 

Although balancing selection clearly 
increased the fitness of invasive Asian honey- 
bees in Australia, it is not clear whether this 
evolutionary force was essential for the suc- 
cessful establishment of the invasion. The inva- 
sive population of A. cerana still increased in 
size despite the skewed allele frequencies at the 
csd gene during the early stages of the inva- 
sion. Social-insect invaders might have other 
attributes that predispose them to be success- 
ful biological invaders. The intrinsic growth 
rates of some social insects might be so high 
that even a 25% increase in female mortal- 
ity — the amount estimated by Gloag and 
colleagues to occur in the initial stages of 
A. cerana’s invasion — did not prevent 
population expansion. 

Queen honeybees typically mate with 
10-20 males, which spreads the consequences 
of diploid male production evenly across the 
invasive population; instead of some colo- 
nies producing a lot of diploid males, with 
others producing none, most colonies in the 
population would be expected to produce a 


moderate level of diploid males'’. Moreover, 
honeybee colonies recoup some of the costs 
of producing diploid males when they are 
cannibalized by workers. Perhaps the reduced 
genetic diversity found in invasive A. cerana 
reduces aggression between colonies as it does 
in invasive Argentine ants!. Nevertheless, 
Gloag and colleagues’ study provides a clear 
example of how rapid evolutionary changes 
can affect the fitness of invasive populations. m 
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Flexible graphene 
strengthens friction 


Previous observations showed that friction on graphene increases gradually 
when a probe starts to slide across the material’s surface. Simulations now reveal 
that this effect is related to bending of the graphene sheet. SEE LETTER P.541 


ASTRID S. DE WIJN 


s anyone who has slipped on wet 

autumn leaves might realize, friction 

in layered materials can behave in 
peculiar ways. Such friction can be studied on 
small scales by looking at graphene, a material 
that consists of single sheets of carbon atoms. 
Experiments’ performed a few years ago 
using atomic force microscopy (AFM) revealed 
that friction is larger when the number of 
stacked graphene sheets is smaller. Moreover, 
when a probe begins to move across the surface 
of stacked graphene sheets, friction initially 
increases and then levels off’. A wide range 
of suggestions has been proposed to explain 
these observations, but on page 541, Li et al. 
solve the mystery using computer simulations. 
They report that graphene’ strange behaviour 
is related to distortions that strengthen friction 
on thin stacks of sheets. 

Friction is so common in our daily lives that 
we barely think about it. Humans have been 
forced to deal with it since prehistoric times: 
the wheel is essentially a device for reducing 
friction when moving heavy loads, and rub- 
bing sticks together to make fire makes use of 
the fact that friction generates heat. We have 
therefore developed a massive amount of phe- 
nomenological knowledge of friction under 
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specific circumstances, and especially of how 
to reduce it. But our fundamental understand- 
ing of how and why friction works is still sorely 
lacking. 

Layered materials are a prime example of 
this knowledge gap. Some of them, such as 
graphite — stacked layers of graphene — and 
molybdenum disulfide are commonly used in 
powdered form as solid lubricants or as addi- 
tives in liquid lubricants, and sometimes as 
low-friction coatings. We use these materials 
all the time, but are only now starting to under- 
stand their workings. 

Graphene has received much attention in the 
past few years (see ref. 4, for example), largely 
because of its remarkable electronic proper- 
ties — research into this material was rewarded 
with the Nobel Prize in Physics in 2010. But 
it is graphene’s mechanical properties that 
are interesting when considering its friction. 
Most importantly for Li and colleagues’ results, 
single sheets of layered materials bend easily. 
Although it might seem obvious that such 
bending must have a role in the friction of thin 
stacks of graphene, many explanations for the 
role of bending were possible. 

To understand Li and co-workers’ results, 
we need to know what surfaces in contact look 
like. Real surfaces are rough on microscopic 
scales, even if they look flat to the naked eye. 


rw A Graphene 


ey 
Area of contact 


Figure 1 | Friction on graphene sheets. a, Surfaces 
that look smooth on a macroscopic scale are 

rough on the microscopic scale. Contact between 
surfaces thus occurs only between microscopic 
projections from the surfaces, and friction between 
the surfaces depends on the strength of the 
interactions between the contacting projections. 

b, Li et al. performed simulations to investigate 
what happens when a nanometre-scale probe is 
dragged along the surface of graphene on a silicon 
substrate. During the initial stages of sliding, 

the strength of the probe's interaction with the 
graphene increases gradually, thus increasing the 
friction between the probe and the graphene. The 
substrate-graphene interaction (blue) also plays a 
part in the observed effects, although its exact role 
is unclear. 


Two surfaces in contact therefore never fit 
perfectly against each other, but meet only 
where microscopic projections come into con- 
tact (Fig. la). The actual area in contact is thus 
typically many orders of magnitude smaller 
than the apparent contact area. 

These microscopic projections play a central 
part in friction. More specifically, the strength 
of the interaction between the microscopic 
contacts, which Lief al. refer to as the quality 
of the contact, is a key factor in the dissipation 
of the kinetic energy of sliding. The interaction 
strength can be affected by many factors, such 
as a material's properties, its chemistry and a 
mismatch of the crystal lattices in the contact- 
ing surfaces’, but also, in the case of graphene, 
by bending*. 

The previously reported AFM experi- 
ments’* measured what happens at a single 
contact on stacks of graphene. Li et al. simu- 
lated those experiments numerically and 
investigated the strength of the interaction at 
a single contact. In the simulations, they put 


one or several layers of graphene on top of a 
substrate (silicon) and rubbed the surface of 
the material with a tiny probe. They observed 
that bending distortions of the graphene 
sheets under the probe gradually increase as 
the probe moves; this increases the contact 
strength, and thereby also the friction (Fig. 1b). 
The effect is bigger if there are fewer layers, 
because thin stacks of graphene are more flex- 
ible than thicker stacks. The authors also find 
that the substrate underneath the graphene 
contributes to the increased contact strength. 
It is worth noting that the authors checked 
carefully to make sure that the effect observed 
in their simulations was not related to the 
sliding speed. This was crucial, because 
current limitations to computational power 
make it impossible to simulate sliding at the 
low speeds used in AFM experiments. 
Despite the fresh insight provided by Lietal., 
many questions remain. What exactly causes 
the bending distortions to grow? And what is 
the role of the substrate under the graphene? 
When the authors altered the parameters of 
their model to increase the binding of the 
substrate to graphene, the contact strength no 
longer changed as the probe moved. The same 
was true when there was no substrate at all. 
Moreover, both the substrate and the probe 
used by the authors were amorphous materi- 
als that lacked any structural order. It would 


CELL BIOLOGY 


NEWS & VIEWS | RESEARCH | 


be interesting to see what happens if the 
simulations are repeated using an ordered sub- 
strate and/or probe. Graphene is often studied 
on ordered substrates in other contexts, and 
such a study might also provide insight into the 
mechanisms at play if the structures of the sub- 
strate and tip affect strengthening in some way. 
Friction is a messy problem to study, because 
its effects are often found in highly complex 
systems that involve a variety of materials and 
chemistry. It is usually not clear which of these 
are essential for a specific friction effect, and 
which have little or no role. This makes mean- 
ingful study of such phenomena difficult. Li 
and colleagues’ finding of increases in contact 
strength ina relatively simple system opens up 
the possibility of studying the strengthening 
process in a much more controlled way. m 
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A mitochondrial brake 
on vascular repair 


Injured blood vessels are repaired by vascular smooth-muscle cells. It emerges 
that the protein Fatl regulates the proliferation of these cells by inhibiting the 
function of mitochondria. SEE LETTER P.575 


CHARLES E. DE BOCK & RICK F. THORNE 


fter injury to the cells that line blood- 
A= walls, vascular smooth-muscle 

cells (VSMCs) move to the injured 
region and proliferate to cover the damaged 
site. However, cellular overproliferation at the 
repair site might cause vessel-wall thickening 
that reduces blood flow through the vessel. 
In addition to naturally occurring damage, 
blood-vessel injury can be a consequence of 
surgical interventions such as transplants or 
procedures to open narrowed blood vessels. 
An understanding of the signals that regulate 
VSMC proliferation might enable the develop- 
ment of clinical approaches to limit this pro- 
cess and prevent blood-vessel narrowing. Cao 
et. al.' demonstrate on page 575 that the Fatl 
receptor protein, a negative regulator of V3MC 


proliferation, has a direct and unexpected role 
in regulating energy production in mito- 
chondria, the organelles that act as cellular 
powerhouses. 

Fatl is a member of the vertebrate Fat 
cadherins, a small family of proteins whose 
primary function is unclear’. Previous work’ 
revealed that Fatl enhances the migration and 
limits the proliferation of VSMCs, providing 
clues that Fat1 could have a role in vascular-cell 
remodelling. 

The Fatl receptor is normally present 
at the cell surface, and its structure con- 
sists of an extracellular domain, a trans- 
membrane segment and an intracellular 
domain’. Cao et al. used mass spectrom- 
etry to identify proteins that interact with the 
intracellular domain of mouse Fat1. They dis- 
covered that 22 Fat] interactors are proteins of 
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50 Years Ago 


Recent reports on the occurrence 
of “pink spot” in the urines of 
schizophrenics are somewhat 
conflicting and cast doubt on 

its chemical identity and its 
relationship to schizophrenia... 
The possibility that “pink spot” 
may be associated with brain 
damage induced by drugs is 
supported by the fact that it is also 
found in high concentration in 
the urines of patients suffering 
from Parkinsonism, a disease in 
which the brain is indisputably 
damaged. This leaves open the 
questions of the relationship of 
“pink spot” to schizophrenia and 
also its relationship to dopamine 
metabolism ... it would be 
interesting to know whether any 
of the patients who have been 
reported in the literature as “pink 
spot” excreters also showed clinical 
signs of brain damage in the form 
of dyskinesia induced by drugs at 
the time of investigation or whether 
they have developed it since the 
reports were published. 

From Nature 26 November 1966 


100 Years Ago 


Whatever difference of opinion 
may exist in regard to the stimulus 
which has moved the Government 
to take control of our food supplies, 
all are agreed that it has not come 
about a moment too soon ... The 
provisions outlined will empower 
the Food Controller to take 
measures both preventive and 
regulatory ... But Government 
measures cannot stop with the mere 
regulation of food supplies. Powers 
must be taken to compel a greater 
production of home food and to 
ensure a larger acreage of wheat ... 
Objection may be raised to the 
shortage of labour ... why should 
prisoners of war not be utilised to 
the fullest degree possible in raising 
the home production of food? 
From Nature 23 November 1916 


Blood vessel 


Smooth- 
muscle cell 


Mitochondrion 


proliferation 


Figure 1 | Fat1 goes with the flow. Injury to the walls of blood vessels can occur through natural causes 
or surgical interventions. Vascular smooth-muscle cells (VSMCs) migrate to the damaged site and 
proliferate as part of the healing response. However, this can result in cellular overproliferation, causing 
vessel-wall thickening and narrowing the blood vessel. The Fat1 receptor protein (blue) limits the 
proliferation of VSMCs’. In studies using mice and human cells, Cao et al. report that a portion of Fat1 
interacts with proteins in the mitochondria and inhibits mitochondrial activity in VSMCs. 


the inner mitochondrial membrane. 

Many biochemists would be cautious about 
further investigating any such interactions 
with mitochondrial proteins because non- 
specific interactions with these proteins are 
common. Nonetheless, Cao et al. tested and 
validated their findings using a variety of 
techniques. The authors observed that Fat1 
is present inside the mitochondria of VSMCs. 
They also found that non-full-length portions 
of Fat1 are enriched in mitochondria, and that 
these portions interact directly with proteins 
of the mitochondrial respiratory machinery 
that generate the energy-carrying nucleotide 
ATP. Subsequent analyses revealed that mouse 
VSMCs lacking Fat] had enhanced mitochon- 
drial respiration in the absence of other altera- 
tions to mitochondrial structure or mass that 
might be responsible for such enhancement. 
Cao and colleagues conclude that Fat1 damp- 
ens mitochondrial respiration to suppress 
VSMC proliferation (Fig. 1). 

The importance of mitochondrial function 
to the proliferative capacity of VSMCs was 
already known*. But how central is Fat] sig- 
nalling to the mitochondrial regulation needed 
to enable VSMC repair of vascular damage? 
Cao et al. established the relevance of this 
pathway in human tissue through in vitro studies 
demonstrating that human FAT1 can regulate 
mitochondrial respiration and proliferation 
in human VSMCs from arterial blood vessels. 
The authors also observed FAT1 expression 
in human VSMCs in an artery-repair context, 
when they tested post-mortem artery samples 
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from people who had undergone surgical 
insertion of a stent device to correct athero- 
sclerosis, an artery-narrowing condition. 

Cao and colleagues investigated a model 
of arterial injury using mice whose smooth- 
muscle cells lacked the Fat1 gene. Injured 
arteries in these mice became significantly nar- 
rower than did those of control mice, establish- 
ing that the presence of Fat] normally limits 
the potential of VSMCs to contribute to vas- 
cular blockage. However, although this model 
system might be relevant to many aspects of 
vascular-cell proliferation and remodelling, 
it does not reflect all the processes involved 
in vascular disease. Nevertheless, this work 
establishes a framework with which to fur- 
ther investigate the Fatl and mitochondrial 
signalling pathway — for example, by testing 
whether this mitochondrial-control pathway 
is evolutionarily conserved in other members 
of the Fat protein family. 

Vascular biology is not the only relevant 
setting for Fat1 investigation. Roles for Fat1 
have been proposed in a surprisingly diverse 
range of disease states’”. Cancer is of particu- 
lar interest, given that studies” in the fruit fly 
Drosophila melanogaster indicate that Ft, a 
fly version of Fat1, is a tumour-suppressor 
gene. A tumour-suppressor function is con- 
sistent with reports of repression or inactiva- 
tion of FAT] in certain human cancers”* — for 
example, loss of FAT1 can result in tumour- 
promoting Wnt-protein signalling®. 

However, FAT1 is overexpressed in some 
types of cancer’, creating a conundrum about 


whether this protein promotes or inhibits 
tumours. FAT1 might be overexpressed in 
cancer because tumour cells need to escape 
destruction by cell-death processes, and high 
levels of FAT 1 protect against cell death’. FAT1 
is also upregulated in low-oxygen (hypoxic) 
conditions’. The stressful cellular environ- 
ment that results from hypoxia often occurs 
in solid tumours, and perhaps FAT1 upregu- 
lation allows tumour cells to survive in such 
conditions. As is so often the case in biology, 
the determination ofa protein's action requires 
observations to be placed in the correct cellular 
context. The challenge is to discover whether 
Fat1’s function in controlling mitochondrial 
respiration is relevant to human cancer. 
Many questions remain. Previous work"” 
shows that D. melanogaster Ft enhances 
mitochondrial function, which contrasts with 
Cao and colleagues’ observations that Fat] 
inhibits mitochondrial function in the mam- 
malian systems they studied. Whether this dif- 
ference results from evolutionary divergence or 
reflects a cell-dependent context is unknown. 
Discovering how Fat is regulated is a 
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key area for further investigation. Cao et al. 
observed that smaller portions of Fat1, rather 
than the full-length protein, are associated with 
the mitochondria. This observation suggests 
that Fat] undergoes enzyme-mediated cleavage 
to enter the mitochondria. Fat1 is a substrate 
of the y-secretase protein complex’? — the 
protease enzyme that cleaves and releases cyto- 
plasmic domains of transmembrane proteins” 
— although the authors did not investigate Fat] 
cleavage by this or any other enzyme. 

It is also not known whether a ligand 
molecule binds the extracellular domain of 
Fat1. Given that endothelial cells in blood- 
vessel walls sense mechanical damage and 
relay this information to adjacent VSMCs", 
endothelial cells might signal to Fat1 by 
an unknown ligand. An understanding of 
such aspects of FAT1 regulation will prob- 
ably be needed for the therapeutic potential 
of this signalling pathway to be exploited in 
the clinic. m 


Charles E. de Bock is at the VIB Center for 
the Biology of Disease, Leuven 3000, Belgium, 


Improving the image 
of nanoparticles 


A biocompatible probe that combines fluorescent nanodiamonds and gold 
nanoparticles allows cells to be imaged using both optical and electron 
microscopy techniques, opening up fresh opportunities for biological research. 


CHRISTOPHER S. WOOD 
& MOLLY M. STEVENS 


iamonds are renowned for their 
D sparkle, but over the past few years 
many scientists have become interested 
in nanometre-scale fragments of diamonds 
that fluoresce. Such nanodiamonds are 
attractive probes for fluorescent cell imaging" 
because they fluoresce intensely, show low tox- 
icity to cells and are stable to chemicals and 
light. However, they cannot easily be tracked 
using electron microscopy, limiting their utility 
as biological probes. Writing in Nano Letters, 
Liuet al.” report that they have combined fluo- 
rescent nanodiamonds with gold nanoparti- 
cles to create ‘all-in-one’ hybrid particles that 
can be used in both fluorescent imaging and 
electron microscopy. This potentially opens 
the way to studying cellular processes such as 
endocytosis — a membrane-dependent pro- 
cess that delivers materials to cells — ina single 
sample at multiple resolutions. 
The use of nanoparticle-based probes is the 
latest in a long line of developments in imaging 


technology. Although electron microscopy 
can resolve structures smaller than is achiev- 
able using light, the samples studied must 
be fixed or frozen, providing limited insight 
into the dynamic processes of living systems. 
Super-resolution microscopy techniques’ 
based on fluorescent imaging overcome this 
limitation and allow the visualization of bio- 
logical processes at work, even at the molecular 
level*’. However, these techniques are limited 
by the brightness of the required fluorophores 
(fluorescent proteins or small molecules), and 
by the light and chemical stability of those 
molecules. 

Researchers are increasingly turning to 
nanoparticle-based imaging probes to over- 
come these obstacles. Matter on the nanoscale 
can give rise to phenomena such as intense 
fluorescence, which minimizes some of 
the drawbacks associated with molecular 
fluorophores. For example, fluorescent nano- 
diamonds (FNDs) contain lattice defects 
known as single nitrogen-vacancy centres, 
which fluoresce at wavelengths of about 
700 nanometres when excited by a laser. 
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Nanoparticles also provide additional modes 
of imaging and methods for probing their local 
environments — such as surface-enhanced 
Raman scattering® or nanoscale magneto- 
metry’ (the detection of magnetic fields at 
nanoscale resolution). However, these abilities 
are rarely combined in a single particle. 

Gold nanoparticles (AuNPs) are extensively 
used in biosensing and bioimaging because 
they enhance the contrast of structures in elec- 
tron microscopy. Many sizes of AuNPs quench 
the excited states of fluorescent molecules, an 
effect that has been exploited in numerous 
highly sensitive biosensors®. This otherwise 
useful property limits our ability to track 
AuNPs in living systems using fluorescence 
microscopy. But in Liu and co-workers’ hybrid 
nanoparticles, FNDs fluoresce even though 
they are in close proximity to similarly sized 
AuNP contrast agents. AuNPs and FNDs have 
been combined before’, but the current work 
demonstrates the potential of hybrid-parti- 
cle constructs to provide multiple modes of 
imaging or sensing at a single point ina sample. 

Developing complex biocompatible probes 
and biosensors from multiple nanoscale com- 
ponents is challenging. The difficulty arises 
partly because nanoparticles tend to form 
aggregates, especially in biological environ- 
ments, but also because it is hard to form and 
purify stable particle constructs that contain 
a defined number of components. To over- 
come these problems, Liu et al. associated 
the AuNPs with the FNDs by mixing them 
with the protein human serum albumin 
(which interacts with FNDs and AuNPs), and 
stabilizing the resulting complexes by forming 
covalent crosslinks between the AuNPs and 
the protein and between the protein and the 
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Figure 1 | Preparation of hybrid nanoparticles for biological imaging 
studies. Liu et al.’ have made nanoparticles that contain fluorescent 
nanodiamonds (FNDs) and gold nanoparticles (AuNPs). a, They began 
by mixing AuNPs (decorated with organic molecules to prevent their 
aggregation) with FNDs anda protein (human serum albumin). This 
generated hybrid particles composed of an FND with protein molecules 
adsorbed to its surface, along with one or two AuNPs. b, The authors then 


FNDs (Fig. 1). The authors then purified the 
desired products by ultracentrifugation, and 
coated them with a positively charged polymer 
to further stabilize them and to enhance cell 
uptake. The resulting particles contained one 
or two AuNPs per FND, were readily absorbed 
by cells and showed low cytotoxicity. 

Intriguingly, covalently linking the AuNPs 
to the FND led to an improvement in FND 
fluorescence and a reduction in the lifetime 
of excited states in the FNDs, both of which 
occur as a result of coupling between the 
nitrogen-vacancy centres and AuNP surface 
plasmons (collective oscillations of delocal- 
ized electrons). The authors also attribute the 
increased fluorescence to concentration of the 
exciting laser pulses by the AuNPs, leading to 
more-efficient laser excitation of the hybrid 
particles — although they do not discuss a 
precise mechanism for this. 

The enhanced fluorescence is particularly 
interesting because it might improve the 
signal-to-noise ratio of FNDs in sensing and 
imaging based on nanoscale magnetometry’. 
The nitrogen-vacancy centres found in FNDs 
are particularly sensitive to external magnetic 
fields, and so nanoscale magnetometry using 
FNDs is emerging as a potentially useful sens- 
ing technique in biological environments”. It 
could enable the detection of single electrons, 
for example, and thus provide insights into 
biological processes, such as respiration, that 
involve electron transfer or the recombination 
of free radicals”. 

The development of Liu and colleagues’ 
biocompatible hybrid-nanoparticle probes is 
an important step towards improved imag- 
ing of cell mechanisms, and opens up fresh 
possibilities for sensing within cells. For 
example, the authors observed both the uptake 
of FND-AuNPs by cells using optical micro- 
scopy, and the release of the nanoparticles 
from cellular vesicles called endosomes into 
the cytosol using transmission electron micro- 
scopy. However, it remains to be seen whether 
the hybrid nanoparticles can be used in vivo. It 
is also unclear whether the authors’ approach 
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for attaching AuNPs to FNDs can be used to 
attach other nanometre-sized structures, such 
as antibodies, enzymes and further types of 
nanoparticle. 

A robust, modular method for synthesizing 
hybrid nanoparticles in general — not just 
those involving FNDs — is now needed to 
allow scientists to rapidly combine different 
imaging probes and biosensors without exten- 
sive optimization. This would enable biological 
environments to be probed in many ways at 
specific points within a sample. Such a method 
would also enable nanoparticle-based drug- 
delivery systems to be incorporated easily 
into hybrid probes. This could lead to rapid 
prototyping of ‘theranostic’ agents, in which 
particles deliver therapeutic compounds to 
patients and report the effects using several 
forms of information. = 
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stabilized the hybrid particles by forming covalent crosslinks between the 
organic groups on the AuNPs and the proteins, and between the proteins 
and the FNDs. c, Finally, the hybrid particles were coated with a positively 
charged polymer, enabling them to be taken up efficiently by cells. The FND 
component allows the particles to be tracked in cells using fluorescence 
imaging, whereas the AuNP components enable the particles to be imaged 
using electron microscopy. (Adapted from ref. 2.) 
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A war over water when 
bacteria invade leaves 


Plants and bacteria battle for control of water during leaf infection, as is 
demonstrated by a bacterial species that manipulates plant cells to create a 
water-rich environment that promotes bacterial growth. SEE ARTICLE P. 524 


GWYN A. BEATTIE 


a plant’s defence mechanisms by inject- 
ing what are known as effector proteins 
directly into plant cells. In addition to the 
plant’s defence mechanisms, invading bacteria 
might encounter another obstacle — a severe 
shortage of water in the leaf intercellular space 


[snes bacterial leaf pathogens disarm 
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(apoplast) that limits bacterial growth. On 
page 524, Xin et al.' report that Pseudomonas 
syringae bacteria inject effector proteins that 
specifically target plant-cell processes to pro- 
mote apoplast hydration, thereby creating an 
environment suitable for bacterial growth. 
High humidity favours the development of 
bacterial leaf diseases that are associated with 
the formation of leaf spots. An early symptom 


of leaf-spot diseases is water soaking’, a local- 
ized abundance of water in the apoplast. Exten- 
sive water soaking of plants is associated with 
epidemics and severe disease symptoms’. 

To investigate the relationship between the 
water soaking of leaves and bacterial growth, 
Xin and colleagues used microscopy to ana- 
lyse the thale cress plant Arabidopsis thaliana. 
They observed that water soaking occurred 
only transiently during the start of leaf-spot 
development. The leaf sites that exhibited this 
transient water soaking were also the sites 
of early bacterial replication, and ultimately 
became the locations of disease lesions. These 
results provide direct visual evidence that links 
water soaking to bacterial growth and leaf-spot 
development. 

The WstE and AvrE effector-protein family 
can promote water soaking in some plant 
species*”, although the mechanism involved 
and the role of water soaking in plant infection 
had not been identified for these proteins. Xin 
et al. found that the P syringae effector proteins 
AvrE and HopMI1 can each promote water 
soaking in A. thaliana, and that this soaking 
occurs regardless of whether the proteins are 
expressed by the bacteria or expressed in the 
plant cell by genetic engineering. The authors 
demonstrated that HopM1 induces soaking 
though a pathway that involves degradation 
of the plant protein MIN7. This discovery 
indicates that a bacterial pathogen can actively 
promote hydration of the apoplast by modify- 
ing a plant-cell target. 

How water accumulates in the apoplast 
after MIN7 is destroyed is not known. Possi- 
ble mechanisms include changes in cell-mem- 
brane function and stability, changes to the 
aquaporins — water-transport proteins — in 
cell membranes, or alterations to ion-transport 
activity across the membrane. A role for the 
cell membrane as a target for pathogen-medi- 
ated effects on water movement is supported 
by the facts that MIN7 is involved in maintain- 
ing a normal cell membrane’ and that AvrE 
moves to the cell membrane after injection’. 

Xin et al. found that only two plant functions 
must be altered to enable bacterial infection: 
a plant immune-defence system known as 
pattern-triggered immunity and water limita- 
tion in the apoplast. A plant mutant lacking 
both MIN7 and pattern-triggered immunity 
enabled strong growth, not only of a non- 
pathogenic P syringae strain, but also of other 
bacterial communities normally resident at 
low levels on leaves. The growth of these resi- 
dent communities in the apoplast correlated 
with the appearance of leaf yellowing and 
small regions of plant-cell death, revealing 
that even bacteria not known to be pathogenic 
can exact a detectable toll on plant health if the 
conditions arise to enable excessive bacterial 
proliferation. 

The humidity in the air outside the leaf 
can influence a plant’s ability to maintain a 
water-limited apoplast, because this ambient 
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Figure 1 | A bacterial pathogen can affect the hydration around leaf cells. a, Bacteria need water to 
grow, and plants minimize bacterial growth in leaves by limiting water in the space between leaf cells 
(the apoplast). Pathogenic bacteria such as Pseudomonas syringae inject a variety of proteins directly into 
plant cells to aid invasion. These proteins include inhibitors of the plant immune system. Xin et al.’ report 
that two injected bacterial proteins, HopM1 and AvrE, can each induce apoplast hydration. The authors 
demonstrate that HopM1 degrades MIN7, a plant protein that has a key role in limiting fluid loss from 
plant cells. However, under conditions of low ambient humidity, the bacterial injection of HopM1 and 
AvrE may induce only low levels of apoplast hydration. The mechanism by which AvrE induces apoplast 
hydration is unknown, but might involve targeting of a host protein complex". b, Under conditions 

of high ambient humidity, HopM1 and AvrE induce high levels of apoplast hydration known as water 
soaking. Water soaking in the apoplast is essential for bacterial proliferation and for the development of 


symptoms of plant disease. 


air connects directly to the apoplast through 
pores called stomata. Low-humidity air should 
help to limit the water in the apoplast and aid 
in antibacterial defence. This is consistent with 
the low levels of leaf-spot disease observed in 
arid conditions. By contrast, continuous water 
soaking should negate a defence strategy 
of maintaining low apoplast hydration, and 
this is consistent with the abundant growth 
of non-pathogenic bacteria in the apoplast of 
persistently water-soaked leaves”. 

Xin and colleagues tested different ambient 
humidity conditions, and demonstrated that 
the humidity must be high to enable the patho- 
gen to induce water soaking that is dependent 
on the injection of AvrE or HopM1 (Fig. 1). 
This strong requirement for humidity reflects 
a fundamental tenet of plant pathology: 
disease depends on the host, the pathogen and 
the environment. 

Plants and pathogens battle for water during 
the initial pathogenic attack, and also later, 
after the plants have been infected. Xin and 
colleagues found that MIN7 is stabilized by 
a type of plant immune response known as 
effector-triggered immunity. This indicates 
that the plants have evolved mechanisms to 
counteract the bacterially induced destruction 
of MIN7 that enables apoplast hydration. The 
ability of plants to affect such hydration is 
consistent with the low apoplast hydration 
observed in A. thaliana plants that can resist an 
attack by P. syringae®. Reduced fluid movement 
into the apoplast during effector-triggered 


resistance of A. thaliana to P. syringae attack 
suggests that, as well as preventing apoplast 
hydration, plants might also actively promote 
drying of the apoplast’. 

The authors’ study supports a model in 
which plants maintain a water-limited apoplast 
as a barrier against bacterial growth, and in 
which pathogen effectors disarm this barrier, 
leading to water soaking in conditions of high 
humidity. A key next step in understanding 
this fascinating plant-pathogen battle for con- 
trol of water will be to identify the nature of the 
‘flood gates’ to the apoplast and the molecular 
mechanisms by which pathogens open them. m 
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Emergent phenomena induced by spin- 
orbit coupling at surfaces and interfaces 


Anjan Soumyanarayanan!”, Nicolas Reyren®, Albert Fert? & Christos Panagopoulos! 


Spin- orbit coupling (SOC) describes the relativistic interaction between the spin and momentum degrees of freedom 
of electrons, and is central to the rich phenomena observed in condensed matter systems. In recent years, new phases 
of matter have emerged from the interplay between SOC and low dimensionality, such as chiral spin textures and spin- 
polarized surface and interface states. These low-dimensional SOC-based realizations are typically robust and can 
be exploited at room temperature. Here we discuss SOC as a means of producing such fundamentally new physical 
phenomena in thin films and heterostructures. We put into context the technological promise of these material classes 
for developing spin-based device applications at room temperature. 


he electric field experienced by a travelling electron translates, in 

its rest frame, to a magnetic field proportional to its velocity— 

a relativistic effect which is notable in crystalline lattices with 
heavy atoms. The Zeeman interaction between the electron spin and 
this effective magnetic field is equivalent to the coupling of the elec- 
tronic spin and momentum degrees of freedom, known as SOC. SOC 
can split degenerate bands with finite angular momentum (p, d and f), 
modifying the electronic band structure. Importantly, SOC effects 
are greatly enhanced in reduced dimensions (Fig. 1, left and right). 
First, inversion symmetry is broken at the surface or interface, and the 
resultant electric field couples to the spin of itinerant electrons. This 
phenomenon, known as Rashba SOC’, produces spin-split disper- 
sion even at the surfaces of conventional metals (such as Au and Bi)?. 
Recently discovered topological insulators, have spin-polarized surface 
states with additional topological properties. In both these cases, strong 
two-dimensional (2D) SOC locks the electron spin and momentum. 

Spin-momentum locking in 2D geometries has direct consequences 
for the interplay between the charge and spin transport (Fig. 1, top 
left). An in-plane charge current induces a transverse spin accumula- 
tion (uniform non-zero spin density). This spin accumulation can be 
used to eject a spin current into an adjacent layer (Edelstein effect). 
Conversely, the injection of a spin current induces the associated spin 
polarization and charge current in the 2D states. Other types of con- 
version between charge and spin currents can also be obtained by SOC 
effects in three-dimensional (3D) conductors, namely the spin Hall 
effect of heavy metals*; however, the observed effects in two dimen- 
sions are considerably enhanced. Such spin-charge conversion phe- 
nomena have direct applications for spintronics technologies, which 
are based on the creation and detection of spin currents°. Given that the 
3D spin Hall effect is already used in spintronics devices®, the observed 
effects in two dimensions offer much promise for device applications 
(Fig. 1, top). 

The interplay between SOC and magnetism is of increasing impor- 
tance. In conventional magnetic materials, ferromagnetic order, which 
results from exchange interaction, aligns neighbouring spins. A well- 
known consequence of SOC is magneto-crystalline anisotropy—the pref- 
erential alignment of electron moments along certain crystallographic 
directions (‘easy axes’), via the coupling of electron motion to the crystal- 
line lattice field. In systems that lack inversion symmetry, SOC induces 


a chiral Dzyaloshinskii-Moriya interaction (DMI)”*, which takes the 
form: 


Hpm = — (S, x $2) - Dip (1) 


Here S, and S, are neighbouring spins and D), is the Dzyaloshinskii- 
Moriya vector. The DMI is a chiral interaction that decreases or increases 
the energy of the spins depending on whether the rotation from S$; to 
S, around Dj, is clockwise or anticlockwise. If S$, and S, are initially 
parallel, then the effect of a sufficiently strong DMI (with respect to 
exchange and anisotropy) is to introduce a tilt around Dj. DMI was 
initially understood as a super-exchange interaction in magnetic insu- 
lators”*, and later extended to non-centrosymmetric magnetic metals’. 
Ina disordered magnetic alloy, a large SOC element could mediate such 
an interaction between two nearby magnetic atoms, with the resulting 
Dzyaloshinskii-Moriya vector being perpendicular to the plane formed 
by the three atoms. Crucially, this model was extended to magnetic mul- 
tilayers, wherein inversion symmetry is broken by the presence of an 
interface!” (Fig. 1, bottom right). The existence of interfacial DMI was 
first demonstrated by the observation of spiral-like spatial modulations 
of the spin orientation with a winding periodicity related to the magni- 
tude of the DMI (ref. 11). DMI also enables the formation of other chiral 
spin structures—in particular, chiral domain walls and skyrmions—that 
are possibly relevant to next-generation information storage devices 
(Fig. 1, bottom). 

Recent developments in the techniques for thin-film growth and 
in the capabilities of ab initio calculations have enabled the synthesis 
of atomically flat surfaces and heterostructures, and the prediction 
of their electronic properties. A common thread across several such 
thin-film materials and heterostructures—heavy metal compounds 
and multilayers—is that the SOC strength at surfaces and interfaces is 
comparable to the other relevant energy scales, and so plays a pivotal 
part. In combination with surface and interface effects, this engenders 
fundamentally new spin-based phenomena that are robust to disorder 
and thermal fluctuations, with much promise for room-temperature 
spin-based applications. 

Here we describe these diverse low-dimensional spin-based phenom- 
ena in the context of their SOC origin. We begin by detailing the progress 
on spin-polarized states at the surfaces of topological insulators, Rashba 
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Figure 1 | Emergent phenomena from spin-orbit coupling (SOC) at 
surfaces and interfaces. A schematic illustration of the connection between 
the presence of strong SOC at material surfaces and interfaces (inner ellipse) 
and the resulting emergence of new interactions and electronic states 
(middle ellipse), such as Dzyaloshinskii-Moriya interaction (DMI; see 

Fig. 4a, e for details), Rashba interfaces (Fig. 2b, d) and topological surface 


interfaces and atomically thin (2D) materials, and examine their utility 
towards the generation and conversion of spin currents. Next, we describe 
the developments on interfacial-DMI-induced non-collinear spin textures 
(skyrmions and chiral domain walls) in magnetic films, and techniques 
to generate, stabilize and manipulate them in devices. Finally, we explore 
the feasibility of realizing the technological promise of these diverse SOC- 
induced surface and interface phenomena towards room-temperature 
device applications. 


Spin-polarized surface and interface states 

Rashba states 

The Rashba effect arises from SOC and broken inversion symmetry at 
material surfaces and interfaces!, with the corresponding Hamiltonian: 


Ha=voz-(k xo) (2) 


Here vo is the Rashba parameter, o is spin, k is momentum and Z is the 
unit normal to the surface or interface. The Rashba effect results in spin- 
split 2D dispersion surfaces and, importantly, in the locking of spin and 
momentum degrees of freedom to each other (Fig. 2b). 

Rashba SOC-split states have been investigated across various sur- 
faces and interfaces”!”!3, as shown for angle-resolved photoemission 
spectroscopy (ARPES) measurements of the Au(111) surface (Fig. 2d)"°. 
Interface alloying of heavy elements with intermediate-weight metals 
can enhance the in-plane potential gradient via hybridization, leading to 
more pronounced Rashba effects, as on the Bi/Ag(111) alloyed interface 
(vo=3eV A; ref. 14). 


Topological surface states 
In materials with heavy elements, strong SOC can split the p band by a 
large enough magnitude to flip the s—p band structure, inducing band 
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states (TSS; Fig. 2a, c). These emergent phenomona can in turn be used to 
generate new 2D spintronics effects (outer ellipse), such as spin-charge 
conversion (Fig. 2e, f and 3), the photogalvanic effect, enhanced SOC 

in 2D materials, such as graphene (Fig. 3d, e), magnetic skyrmions 

(Fig. 4b) and chiral domain walls (Fig. 4c), which have direct device 
applications (periphery). FM, ferromagnet; NM, non-magnetic material. 


inversion. Notably, 2D heterostructures of Hg,Cd,_,Te exhibit, in 
addition to such an inversion, an associated topological phase 
transition!>!°, which results in protected states at the edges of the sample. 
The presence of edge states in 2D heterostructures was subsequently 
generalized to 3D insulators'”!®. The surfaces or interfaces of such topo- 
logical insulators must host protected states at time-reversal-invariant 
k-space points!”!8, These topological surface states have a nearly linear 
energy-momentum relationship (Fig. 2a)'*. The Dirac Hamiltonian that 
describes these surface states, Hp x voZ-(k x a), has the same Rashba 
form (Equation (2)) and locks the spin and momentum degrees of 
freedom (Fig. 2a, c)!8. However, whereas Rashba SOC leads to spin-split 
parabolic surface states in conventional metals, topological surface states 
are distinguished by their helical single Dirac cone character, which 
emerges from the requirement to connect the bulk valence and 
conduction bands. 

ARPES measurements demonstrated the topological nature of surface 
states first in the indirect-bandgap semiconductor Bi,_,Sb, (ref. 19) and 
then in a larger, direct-bandgap (300 meV) topological insulator BizSe3 
(ref. 20). The discovery of a simple Dirac cone within the bandgap of bulk 
Bi,Se3 (Fig. 2c), with a chemical potential that is tunable via chemical 
doping” and the electric field effect”!, has since led to the discovery of 
several other single-Dirac-cone topological insulators’®. 

The electronic transport of topological insulators is governed by the 
helical Dirac nature of topological surface states. First, surface-state 
transport arises from a 2D Dirac cone: therefore, it can be ambipolar, 
controlled by electric fields, and tuned through the Dirac point with a 
characteristic minimum conductivity?!. Second, spin-momentum lock- 
ing prevents backscattering between states of opposite momenta with 
opposite spins, as evidenced across several topological insulators”. 
Because backscattering dominates charge dissipation in conventional 
metals, quasiparticles of topological insulators are expected to exhibit 
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Figure 2 | Band structure and spin-charge conversion in spin-polarized 
2D states. a, b, Schematic of the spin-polarized band structure (electron 
energy E as a function of in-plane momentum k) of 2D electron states 

at the surfaces and interfaces of topological insulators (a) and at 

Rashba systems (b). The arrows indicate electron spin, with blue and 

red dispersion surfaces corresponding to opposite spin helicities. 

c, d, ARPES measurements of the 2D band structure, with respect to the 
Fermi level (E =0), at the surface of the topological insulator Bi,_,Ca,Se3 
(c) and at the Rashba surface of Au(111) (d). The red arrows in c indicate 
the spin orientation, and the bulk bands are schematized in brown. e, k- 
space schematic of charge-to-spin conversion in topological insulators via 


longer lifetimes, enabling ballistic phenomena such as the quantum Hall 
effect”’, Third, the polarization of light incident on a topological insulator 
can couple to the surface-state momentum, thereby generating spin-po- 
larized photocurrents with high fidelity‘. Finally, magnetic doping of 
topological insulators breaks time-reversal symmetry, leading to a gap 
at the Dirac point”® and a reorientation of the low-energy spin texture”®. 
The mediation of magnetism by Dirac fermions in topological insulators 


enables exotic phenomena, such as the quantum anomalous Hall effect”. 


Conversion between spin and charge currents 

The helical spin polarization of Fermi contours of Rashba interfaces and 
surfaces of topological insulators enables the conversion between spin and 
charge currents by the Edelstein and inverse Edelstein effects* (Fig. 2e, f). 
Ina single-cone topological insulator, a charge current—that is, a shift of 
the Fermi contours in the direction of the electron motion (x) (Fig. 2e)— 
induces an overpopulation of spins in the transverse direction (y) as a 
result of spin-momentum locking, and is therefore associated with a non- 
zero spin accumulation. The spin accumulation can diffuse through an 
interface into an adjacent conducting material, resulting in a pure 3D spin 
current being injected into this material, without a net charge flow. The 
spin-charge conversion yield is quantified by the inverse length qics—the 
ratio between the resultant 3D spin current density and the applied 2D 
charge current density”*. Furthermore, if the spin current is injected into 
a magnetic material, then the resulting spin-transfer torque can be used 
to switch its magnetization. Alternatively, an a.c. charge current could be 
used to induce ferromagnetic resonance. 

In the inverse conversion of a spin current into a charge current by the 
inverse Edelstein effect on the interface of a topological insulator (Fig. 2f), 
the injection (extraction) of spins oriented along +-y (—y) into (from) the 
states of a helically spin-polarized Fermi contour populates (depopulates) 
states on the +x (—x) side of the Fermi contour. This out-of-equilibrium 
distribution corresponds to a charge current. The conversion coefficient— 
the ratio between the induced 2D charge current density and applied 
3D spin current density—is the inverse Edelstein effect length \jgr 


k, (nm) 


REVIEW 


Edelstein effect 


the Edelstein effect. A charge current J, at the surface causes a shift Ak of 
the Fermi contour, resulting in a non-zero spin density as a result of the 
helical spin orientation. This spin density can diffuse as a spin current in 
the adjacent material. f, Spin-to-charge conversion by the inverse Edelstein 
effect in topological insulators. Injecting a spin current (J,; spin-polarized 
wiggles) into the surface states of the topological insulator overpopulates 
states on one side of the Fermi contour and depopulates states on the 
other, generating a charge current. Panels a and b were adapted from 
figures 11.3(a) and 11.5(a), pages 181 and 183, of ref. 5, by permission of 
Oxford University Press. Panel c adapted from ref. 20, Nature Publishing 
Group. Panel d adapted from ref. 13, American Physical Society. 


(refs 29, 30). For a pure helical ground state \rgp= vp (ref. 31), where 
vp is the Fermi velocity and 7 is the relaxation time of an out-of- 
equilibrium distribution in the interface states. The spin-charge conver- 
sion by the Edelstein and inverse Edelstein effects at Rashba interfaces 
can be described in an analogous fashion, but accounting for the partial 
compensation of the two Fermi contours of opposite chirality, yielding 
AEE = VoT/h (refs 29, 30), where fi is the reduced Planck constant. 

Examples of topological insulators in which charge-to-spin conversion 
effects have been observed include (Bi;_,Sb,)2Te3 (ref. 28) (Fig. 3a). Here, 
the charge-to-spin conversion of the applied a.c. charge current generates 
a vertical spin current, and a spin-transfer torque on the magnetization 
of the deposited NiFe layer. When tuning the Fermi level across the Dirac 
point by varying the doping x, the conversion coefficient qics does not 
change sign because both charge and spin chiralities change simultane- 
ously (Fig. 3a). However, qics exhibits a sharp minimum at the Dirac point, 
at which a finite scattering rate between the upper and lower cone can mix 
the spin polarizations. Similar results of the generation of spin polarization 
and spin torques have been obtained for other topological insulators». 

Spin-to-charge conversion on topological insulators by the inverse 
Edelstein effect has been achieved both in spin pumping® and in tunnel- 
ling spin injection®® experiments. In the case of spin pumping conversion 
(Fig. 3b, c)*4, a vertical spin current J,, which is produced by ferromag- 
netic resonance from a ferromagnetic layer (NiFe), is injected into the 
topological insulator a-Sn through an intermediate Ag layer (Fig. 3c). 
The persistence of the Dirac cone on a-Sn after Ag deposition is veri- 
fied by ARPES (Fig. 3b), whereas direct deposition of Fe on Sn destroys 
the Dirac cone. A charge current induced by the inverse Edelstein effect 
is consistently found with Ag on a-Sn (Fig. 3c, blue), and not with Fe 
(Fig. 3c, green). Here, Azz =2.1 nm at room temperature—a spin-charge 
conversion efficiency that is at least an order of magnitude higher than 
that obtained with the inverse spin Hall effect in metals such as Pt or W 
(ref. 31). 

Analogous spin-charge conversions by the Edelstein and inverse 
Edelstein effects can also be obtained at Rashba interfaces””*”. However, 
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Figure 3 | Spin-charge conversion experiments. a, Charge-to-spin 
conversion by the Edelstein effect in spin torque ferromagnetic resonance 
(ST-FMR) experiments on the topological insulator (Bi, _,Sb,)2Te; at 

T= 10K. The graph shows the spin-charge conversion coefficient qics for 
several Sb concentrations x, and the corresponding Fermi levels in the 
Dirac cone. (Error bars represent the standard deviation over five samples 
with different dimensions.) Inset, schematic of the device, in which an 
applied longitudinal a.c. charge current J, (associated with an a.c. magnetic 
field H,-) is converted by the Edelstein effect into a vertical spin current J. 
This spin current is injected through a Cu layer into the top NiFe layer and 
detected by a ST-FMR-induced d.c. voltage, in an external d.c. field Hac 
applied at an angle 0. b, c, Spin-to-charge conversion by the inverse 
Edelstein effect in spin-pumping experiments on the topological insulator 
a-Sn. b, ARPES intensity maps (for varying electron energy E with respect 
to the Fermi level and in-plane momentum kj) show that the surface-state 
Dirac cone on «-Sn is preserved (symbols on the red lines represent 
maxima in ARPES intensity scans) even after Ag deposition (to a thickness 


owing to the compensation between the two Fermi contours of the Rashba 
interface, the conversion is generally not as efficient as for topological 
insulators. An exception is the interface between the insulating oxides 
SrTiO; and LaAlOs, for which inverse Edelstein effect values as large as 
6nm have been reported**. 


Materials directions and applications 
Despite tremendous excitement surrounding topological insulators, 
the development of technologically relevant materials has been hin- 
dered by several issues. In binary topological insulators (for example, 
Bi,Ses), intrinsic defects pin the chemical potential within the bulk bands, 
diminishing the contribution of the surface-state transport”. Although 
chemical doping could tune the chemical potential into the bandgap, this 
markedly reduces the mean free path”. Therefore, recent efforts have 
used epitaxial techniques to fabricate ternary topological insulators with 
better control over the chemical potential and mobility’. Other efforts 
towards functional topological insulators include systems with larger 
Rashba SOC and stack engineering of strong SOC materials*°. Another 
emerging direction lies at the crossover between Dirac and Rashba behav- 
iour, with the Dirac/Rashba character of quasiparticles tuned by film 
thickness (for example, few-layer Bi,Se; films*!) or chemical potential 
(for example, Sb; ref. 42). 

The partial freezing of backscattering for surface and interface states 
of topological insulators, which results in reduced energy dissipation 
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of 2.3nm). c, A d.c. voltage generated by the inverse Edelstein effect is 
observed when «-Sn is covered by Ag (blue), but not when it is directly 
covered by Fe, which destroys the Dirac cone (green). Inset, a FMR spin- 
pumping device (the magnetization M of the ferromagnetic layer in an 
external field B = ppHa., where jp is the permeability of free space, excited 
by an a.c. field) injects a vertical spin current J; through an Ag layer into 
the surface states of the topological insulator, which generates a charge 
current I,. d, Calculated band structure of graphene after Au intercalation 
between graphene and substrate, matched to ARPES measurements near 
the K point. Blue and red symbols indicate opposite spin orientations 
derived from fits to spin-resolved ARPES measurements. e, Spin-to-charge 
conversion by spin pumping from yttrium iron garnet (YIG) into 
graphene: experimental set-up (inset) and lateral voltage V induced by the 
inverse Edelstein effect for opposite applied fields. Panel a adapted from 
ref, 28, Nature Publishing Group. Panels b and ¢ adapted from ref. 31, 
American Physical Society. Panel d adapted from ref. 47, Nature 
Publishing Group. Panel e adapted from ref. 50, American Physical Society. 


by electrical currents, is a notable advantage for their use in low- 
power nano-devices. As a next step, the ambipolarity of topological 
surface states could be exploited to create, for example, topological 
p-n junctions* and spin transistors. The spin Hall effect of heavy 
metals is already used in spin-orbit torque magnetic random-access 
memory (SOT-MRAM) switching’, and similar conversions by topo- 
logical insulators or Rashba interfaces are expected to be much more 
energy-efficient™. 


Enhanced SOC in 2D materials 
As with surface and interface states of topological insulators, the 2D 
electronic states of single-layer graphene are characterized by linearly 
dispersing Dirac cones at the K and K’ points in reciprocal space“. 
In contrast to topological insulators, the SOC magnitude for the sp” 
bonded structure of pristine graphene is small (about 10 1eV)*4, and 
Dirac cones in graphene are generally supposed to be spin-degenerate. 
However, SOC can be greatly enhanced by proximity and hybridization 
with adjacent materials. In-plane and out-of-plane deformations that 
mix the sp” and sp’ orbitals in strained or buckled graphene, as well as 
interactions with adatoms and electric fields, can also lead to enhance- 
ments in SOC*-*”. 

The SOC effects of an ordered interface between graphene and 
another material on the energy dispersion of graphene can be expressed 
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with G=-+1 (—1) for the upper (lower) dispersion surfaces, a = +1 (—1) 
for the split upper (lower) dispersion surfaces, and where A is the SOC 
constant of the system and k = |k|. Therefore, SOC-induced splitting 
of graphene bands leads further to spin-momentum locking and 
chiral spin orientations; for example, the large SOC splitting (about 
100 meV) for graphene on Au (see, for example, Fig. 3d) is caused by the 
strong hybridization between the Dirac-cone states and d states of Au 
(ref. 47). 

Enhancement of SOC in graphene can similarly lead to 2D spin- 
charge conversion effects. For example, ferromagnetic-resonance spin 
pumping from yttrium iron garnet (YIG) into graphene induces a 
definite broadening and a lateral voltage characteristic of the inverse 
Edelstein effect (Fig. 3e)*° that is ascribed to the SOC induced by proxi- 
mity with YIG. Here, Aygz= 1073 nm, which is much smaller than 
with topological insulators or Rashba interfaces, indicating a mode- 
rate SOC splitting. On the other hand, intercalation of, for example, an 
Au layer may further enhance SOC (Fig. 3d), leading to more efficient 
conversion. Similarly, non-local voltages, corresponding to enhanced 
SOC, have been observed on graphene decorated with small amounts 
of adatoms or nanoparticles? ! and attributed to skew scattering on 
enhanced SOC perturbations”. 

Furthermore, it is appealing to explore other 2D systems that possess 
larger intrinsic SOC, such as layered transition-metal dichalcogenides 
(TMDs)*». Spin pumping and spin-transfer torque experiments on 
TMDs>>"4, notably MoS; and WS», have shown promise for spin-charge 
conversion. Whereas semiconducting TMDs have limited utility for spin- 
tronics, owing to their small mobility and large resistivity, heterostruc- 
tures of large-SOC TMDs and high-mobility graphene might be more 
useful. Considering the large SOC enhancement of graphene in proximity 
with WS, (ref. 54), larger spin-charge conversion effects can be expected 
in such structures. Heterostructures of the topological insulator Bi, Te2Se 
and graphene have also shown efficient transfer of spin current to the 
graphene layer. 


Interfacial spin interactions and chiral magnetism 
Interfacial DMI 

Magnetic materials that lack inversion symmetry can host the DMI in 
the presence of strong SOC. Consequently, neighbouring spins tilt with 
respect to each other, leading to spatial modulations of the spin orienta- 
tion. If the magnitude of Dj, (Equation (1)) is sufficiently large, then the 
competition between the ‘winding’ DMI and ‘aligning’ exchange inter- 
actions can give rise to non-collinear ground states''°*°8. Such chiral 
spin structures were initially identified in non-centrosymmetric single 
crystals°*°*", However, of increasing scientific interest and technological 
relevance, is their manifestation in films and multilayers with interfacial 
DMI (Fig. 4a). 

Interfaces between ultrathin magnetic materials and metals with 
strong SOC can host DMI, owing to broken inversion symmetry”. Large 
interfacial DMI (effective magnitude of D2 comparable to the exchange 
constant J) and ensuing spin textures were first observed in ultrathin 
epitaxial magnetic films: spin spirals in Mn on W(110) (ref. 11), and 
skyrmions on Fe and Fe/Pd on Ir(111) (refs 61, 62). In the presence of 
out-of-plane anisotropy (K), this also leads to collinear magnetic domains 
separated by Neel-type chiral domain walls, that is, those with the chirality 
of spin rotation through the domain wall determined by the orientation of 
Dy (Fig. 4c)-®*. More recently, it has been possible to demonstrate the 
presence of DMI-induced skyrmions at room temperature in magnetic 
multilayers grown by sputtering”, opening the possibility for using 
such structures in spintronics technologies”. 

The schematic in Fig. 4a (inset) shows such an interface between a 
magnetic film (Co) and a metal with strong SOC (Pt). Ab initio stud- 
ies have provided insights into the mechanism for interfacial DMI 
across several materials®!”!”2, For instance, it has been shown that, at a 
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Co/Pt interface, the DMI is strongest in the Co layer that is closest 
to the interface, and relatively negligible in other Co layers (Fig. 4a, 
bottom)”. Furthermore, the energy source of the large DMI in the inter- 
face Co layer is located predominantly in the neighbouring large-SOC Pt 
layer (Fig. 4a, top), adding credence to a direct correspondence between 
the DMI and interfacial electronic states. The DMI in such multilayers 
has been measured directly using spin-wave dispersion mapping 
techniques, and recent studies have shown its inverse relationship with 
the thickness of the magnetic layer—a clear signature of its interfacial 
nature”®. 


Magnetic skyrmions in thin films 

The large DMI observed in these magnetic multilayers is of particular 
interest because it induces new chiral spin textures, known as magnetic 
skyrmions. Skyrmions are quasi-2D spin textures wherein the out-of- 
plane magnetization is inverted at the centre and rotates smoothly with a 
fixed chirality across its width (Fig. 4b). Skyrmions are distinguished by 
the topological number S that characterizes the winding of the normalized 
local magnetization m(r) (at position r) in 2D systems: 


TT 
a 


i “ fm) -(Ogm(r) x Oym(r))dr = 4 


This topological number indicates that the skyrmion magnetization 
covers the entirety (47) of the unit sphere®. Even though skyrmion-like 
objects (such as magnetic bubbles) can be stabilized without DMI (for 
example, by dipolar interactions), they would not have a fixed topological 
number S, which is crucial to skyrmion properties’, In fact, the 
non-trivial topology of the skyrmion affords an energy barrier, protect- 
ing its spin structure: the spin configuration cannot be twisted continu- 
ously to obtain a different S (for example, that of uniform polarization). 
Another key property of the magnetic skyrmion is its solitonic nature 
with a finite extension: it can move as a particle for as long as it can be 
stabilized in a uniform ferromagnetic background. 

Skyrmions in ultrathin magnetic films were first observed in epitaxial 
magnetic monolayers on heavy metal substrates (Fe on Ir(111)), for which 
D,2/J can be extremely large (about 1)°!?. Here, skyrmions form a stable 
lattice configuration and the large value of Dj2/J results in spin rotation 
over shorter length scales, reducing the skyrmion size to just a few atomic 
lengths. However, the formation of skyrmions herein requires low 
temperatures (about 10 K). Isolated skyrmions can be stabilized in a 
metastable state by applying a magnetic field, for example in Fe/Pd on 
Ir(111) (Fig. 4d), or at zero field if the DMI is sufficiently large, yet 
smaller than the threshold value for stabilizing a skyrmion lattice or a spin 
spiral’*”, that is, if D <4./AK /1, where D is the normalized DMI per 
unit area of the film, A is the exchange stiffness and K characterizes the 
out-of-plane anisotropy. Such individual skyrmions can further be 
nucleated or annihilated (Fig. 5a). Owing to their topological protection, 
they are highly stable’®, but only in applied field and at low temperature, 
which limits their use as individual particles in devices. 

A crucial challenge for device applications is the room-temperature 
stabilization of such small individual skyrmions. The Néel or Curie 
temperatures of bulk materials that are known to host skyrmions are 
generally below room temperature. In addition, skyrmions in ultrathin 
magnetic monolayers (see, for example, Fig. 4d) are stable only at 
low temperatures. A prominent direction towards practical room- 
temperature skyrmions is the development of multilayers with additive 
interactions at successive interfaces®”’, First, the interfacial DMI can 
be enhanced by an appropriate choice of elements forming the multi- 
layer stack. For example, in the case of Ir/Co/Pt multilayers, the sign 
of D is opposite for Ir/Co and Pt/Co interfaces. As a result, when Ir 
and Pt layers are on opposite sides of the Co layer, their effects are 
additive, thus increasing the net DMI magnitude”. Furthermore, such 
trilayer stacks can be repeated (for example, the ten repeats of (Ir/Co/ 
Pt); Fig. 4e), and skyrmions in successive stacks are coupled through 
ultrathin non-magnetic layers, leading to columnar room-temperature 
skyrmions (>30 nm) stabilized by their larger magnetic volume”. 
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Figure 4 | Interfacial DMI and chiral spin textures. a, Anatomy of 
interfacial DMI from ab initio calculations. Bottom, Layer-resolved DMI 
in a Pt/Co bilayer. Top, distribution of SOC energies associated with the 
DM1 in the interfacial Co layer. Inset, a schematic of DMI at the interface 
between a ferromagnetic metal with out-of-plane magnetization (Co, grey) 
and a strong SOC metal (Pt, blue). The DMI vector Dj), associated with 
the triangle composed of two Co atoms and a Pt atom, is perpendicular to 
the plane of the triangle. S;,, neighbouring spins. b, c, Schematics of the 
spin configuration in interfacial-DMI-induced chiral spin textures such as 
magnetic skyrmions (b) and chiral Néel domain walls (c), with the colour 
scale corresponding to the out-of-plane magnetization component. d, 
SP-STM imaging of an individual skyrmion (with a diameter of 8 nm at a 
field of 3.25 T) in a Fe/Pd bilayer on Ir(111), acquired in constant-current 
topographic mode, with an in-plane magnetized tip, with the modelled 
magnetization overlaid (arrows). e, Skyrmion stabilization in multilayers, 


Room-temperature skyrmions have also been found in magnetic 
bilayers (Figs. 4f, 5b), although generally with a larger diameter®””-”. 
These efforts offer promising directions towards stack engineering of 
magnetic interactions to tune skyrmion properties in films for device 
applications®. 


Detection and manipulation of chiral spin textures 

Skyrmions in epitaxial films were first imaged using spin-polarized 
scanning tunnelling microscopy (SP-STM; Fig. 4d)°. Since then, they 
have been imaged in sputtered multilayer films using various magnetic 
microscopy techniques, including scanning transmission X-ray micros- 
copy (STXM; Fig. 4g)°”*8, photoemission electron microscopy (PEEM; 
Fig. 4f)’8, spin-polarized low-energy electron microscopy (SPLEEM)”, 
and magneto-optical Kerr effect (MOKE) microscopy (Fig. 5b)®. 
Importantly, skyrmions can also be detected using a variety of thermo- 
dynamic and transport techniques*!. In particular, the Berry phase that 
is accumulated by electrons traversing the 2D spin texture of skyrmions 
results in an additional component in anomalous Hall effect measure- 
ments, known as the topological Hall effect™*!, The Hall signal can be 
used to detect the presence of skyrmions and to address their motion 
in films and devices®!**. However, such Hall signatures of skyrmions 
have been detected thus far only in bulk crystal and films with intrinsic 
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illustrated using a multilayer stack of Ir/Co/Pt. The close-up of the trilayer 
shows DMI vectors (Dj and D3,) at the top (Co/Ir) and bottom (Pt/Co) 
interfaces of Co. The effective DMI magnitude is enhanced by the same 
direction of D)2 and D3, at the different interfaces. f, Room-temperature 
skyrmions in a Pt/Co/MgO multilayer in a lithographed 400 nm x 400 nm 
square, seen by XMCD-PEEM, with the magnetization profile along the 
red line shown below. g, Room-temperature skyrmions in (Ir/Co/Pt) x 10 
multilayers patterned into 300-nm-diameter disks (left) or 200-nm-wide 
tracks (right), seen by STXM. Panel a (main panel) adapted from ref. 72, 
American Physical Society. Panel a (inset) adapted from ref. 70, Nature 
Publishing Group. Panel d reproduced from ref. 62, American Association 
for the Advancement of Science. Panels e and g adapted from ref. 67, 
Nature Publishing Group. Panel f adapted from ref. 78, Nature Publishing 
Group. 


DMI*!-*; these techniques remain to be established in multilayer films 
with interfacial DMI. 

Magnetic skyrmions, owing to their small size and non-trivial topology, 
are attractive candidates for data storage in magnetic materials—provided 
that they can be nucleated, moved and read. Several nucleation techniques 
have been explored with micromagnetics simulations’***. In SP-STM 
experiments on Fe/Pd bilayers (Fig. 5a), individual skyrmions were nucle- 
ated and deleted using the current injected from the STM tip™. In other 
experiments, skyrmions have been created by applying field pulses®*. A 
remarkable result in this regard is the recent demonstration of “blowing of 
skyrmion bubbles”, generated by the current divergence out of a con- 
striction (Fig. 5b). In future, skyrmions should be able to be moved with 
notable ease compared with, for example, domain walls* by exploiting the 
SOT provided by the spin current”>*°*’, which emerges naturally from the 
spin Hall effect of the neighbouring heavy metal layers. The dynamic prop- 
erties of skyrmions have been explored using micromagnetics simulations 
and microscopy techniques in device configurations®. These works 
demonstrate that skyrmions can be manipulated with current and field 
pulses in lithographed geometric structures (Fig. 5b, c)°*—techniques 
that can be incorporated in memory devices with relative facility. 

These properties of magnetic skyrmions portend great potential 
towards realizing high-density and energy-efficient memory**”. Several 
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Figure 5 | Manipulation of magnetic skyrmions. a, Individual 
skyrmions (with diameters of 8 nm at a field of 3.25 T) in Fe/Pd/Ir(111) 
before and after SP-STM manipulation, demonstrating the creation and 
annihilation of individual skyrmions at specific locations. b, Skyrmions 
in a Ta/CoFeB/TaO, structure, before (top) and after (bottom) applying 
a current pulse through a constriction, with current-induced nucleation 
and subsequent motion of several skyrmions, as seen by magneto- 
optical Kerr effect (MOKE) microscopy. c, Experimental measurement 
of the current-induced skyrmion velocity in tracks of Pt/Co/Ta and Pt/ 


applications and architectures have been proposed and modelled, includ- 
ing skyrmion-based memory devices (Fig. 5d) analogous to domain-wall- 
based racetrack memory**. The interest in skyrmions with respect to 
domain walls is the smaller current that is needed for their displacement 
and the weaker influence of defects on skyrmion motion”®. More efficient 
SOT could also be obtained by using the spin-charge conversion at the 
interfaces of topological insulators instead of the spin Hall effect. 

DMI of intermediate strength has direct relevance to chiral domain 
walls®, which are also being actively explored®***°. The motion of mag- 
netic domain walls under SOT depends on the relative configuration 
of the domain-wall magnetization and the type of SOT under consid- 
eration®’. The large domain-wall velocity observed in perpendicular 
anisotropy films deposited on heavy metals was initially understood as 
emerging from the SOT at the interface between the magnet and the 
heavy metal”!. However, it has been recently understood that the marked 
enhancement in domain-wall velocity results from the stabilization of 
chiral, Néel-type domain walls by interfacial DMI?”. This DMI-induced 
stabilization suppresses the Walker breakdown mechanism that typi- 
cally limits domain-wall dynamics and explains the efficient SOT action 
on this type of domain wall®. In addition, the domain-wall chirality, 
which is determined by DMI, corresponds to a fixed direction of motion; 
hence, all chiral domain walls move in the same direction in a given 
stack structure. 

The motion of DMI-stabilized chiral domain walls can lead to new 
realizations of nanoscale data storage. For example, consider a memory 
element that stores the information by using the domain-wall position. 
The domain wall can have two stable positions (for example, using a notch 
along a short stripe). Here, the magnetization switch can be measured 
using a magnetic tunnel junction®’. Such memory architectures based on 
switching nanoscale spin structures require much less current than does 
conventional MRAM, wherein the magnetization of the entire device 
needs to be reversed. Furthermore, the fast SOT-induced motion of chiral 
domain walls? is relevant to advancing the development of other con- 
cepts, such as domain-wall racetrack memory*. 
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CoFeB/MgO (different symbols represent results from different devices) 
multilayers using STXM. (Error bars denote the standard deviation of 
multiple measurements.) d, Schematic of a skyrmion-based memory 
device in which skyrmions could be deleted, moved and written by 

the corresponding current j. Panel a adapted from ref. 62, American 
Association for the Advancement of Science. Panel b adapted from ref. 69, 
American Association for the Advancement of Science. Panel c reproduced 
from ref. 68, Nature Publishing Group. Panel d adapted from ref. 100, 
American Association for the Advancement of Science. 


Applications and outlook 

The interplay between SOC and inversion-symmetry breaking has given 
rise to fascinating phenomena at surfaces and interfaces, especially in the 
past decade. Topological insulators have been described as a “new state of 
quantum matter”!?, and the emergence of interfacial DMI has given rise to 
non-trivial spin structures. However, the remarkable properties induced by 
SOC at surfaces and interfaces go beyond chiral magnets and spin topology, 
including several types of 2D materials with intrinsic or engineered SOC. 
The emergent characteristics of these SOC-induced phenomena, which are 
robust at room temperature, offer several potential applications. 

First, spin-momentum locking in topological insulators can be 
exploited, via their interaction with normal metals, to obtain unprece- 
dented efficiency in spin-charge conversions. High spin-charge conver- 
sion efficiency will probably be harnessed in future spintronic devices, 
such as SOT-MRAM or nano-batteries. Second, the protection from 
backscattering in topological insulators can be used in low-dissipation 
devices. The topologically protected skyrmion spin configurations ulti- 
mately represent the smallest achievable size for an emergent non-volatile 
magnetic memory element in magnetic films, with immediate relevance 
to information storage. Skyrmions can be moved, created and annihilated 
in nanostructures, making them suitable for ‘abacus’-type applications 
such as racetrack memory. 

Although several of these avenues for application have emerged only 
in the past decade, the rapid advances along this front make us optimistic 
about the time frame in which we can reasonably expect to see devices 
that realize the potential of SOC-induced properties. Consider skyrmi- 
ons in magnetic multilayers: pioneering efforts on epitaxial films at low 
temperature)” were soon followed by room-temperature observations in 
the kind of sputtered multilayer films®”~°’S typically used in spintronics 
technologies. Demonstrations of their small size, electrical nucleation 
and motion—all under ambient conditions—offer further technological 
promise. Other devices include skyrmion-based transistors”, oscillators 
and microwave detectors”°. Furthermore, concepts of SOT-MRAM could 
be extended to utilize skyrmions. 
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The applications that could emerge from topological insulators may 
evolve over a range of time frames. Fundamental materials challenges 
remain to be overcome, including the integration of topological-insulator 
compounds into existing elements of spintronic technologies and the 
control of defects that impede their exotic properties at room tempera- 
ture. However, recently discovered topological insulators demonstrate 
unprecedented spin-charge conversion efficiency at room temperature”! 
They are therefore suitable candidates to replace spin-Hall-effect heavy 
metals—for example, for memory state switching of SOT-RAM—in the 
near future**. Another exciting prospect is the use of topological insu- 
lators to generate efficient SOT for manipulating skyrmions and chiral 
domain walls. Charge-to-spin conversion by Bi/Ag Rashba interfaces 
has been proposed as a way to develop non-volatile and complementary 
metal-oxide-semiconductor (CMOS)-compatible spin logic devices”®, 
and spin-to-charge conversion in ferromagnet-topological-insulator 
devices can be used for microwave-driven spin batteries’ and electri- 
cal power generators that exploit spin currents induced by temperature 
gradients”. Furthermore, the high fidelity of room-temperature spin- 
polarized photocurrents generated on topological insulators” offer 
promising opto-spintronics applications, including transparent con- 
ducting electrodes, phase modulators and polarizers. Finally, upon 
overcoming the materials challenges, we foresee that the ambipolarity 
of topological surface states could be used to create low-power spin- 
tronics devices such as a topological p-n junction’. One may envisage 
a device where an applied magnetic field turns such a p—n junction into 
an electronic Mach-Zender interferometer”, enabling a tunable junction 
transmission with spin filtering properties. 

The discovery of the novel states induced by SOC and inversion-sym- 
metry breaking at surfaces and interfaces opens up such a broad per- 
spective that the introduction of their topological properties will have a 
definitive and substantial effect on the technology of spintronics. 
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Developmental mechanisms of stripe 


patterns in rodents 
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Mammalian colour patterns are among the most recognizable characteristics found in nature and can have a profound 
impact on fitness. However, little is known about the mechanisms underlying the formation and subsequent evolution 
of these patterns. Here we show that, in the African striped mouse (Rhabdomys pumilio), periodic dorsal stripes result 
from underlying differences in melanocyte maturation, which give rise to spatial variation in hair colour. We identify the 
transcription factor ALX3 as a regulator of this process. In embryonic dorsal skin, patterned expression of Alx3 precedes 
pigment stripes and acts to directly repress Mitf, a master regulator of melanocyte differentiation, thereby giving rise to 
light-coloured hair. Moreover, Alx3 is upregulated in the light stripes of chipmunks, which have independently evolved 
a similar dorsal pattern. Our results show a previously undescribed mechanism for modulating spatial variation in hair 
colour and provide insights into how phenotypic novelty evolves. 


The underlying mechanisms of how repetitive morphologic structures 
develop and evolve remain unclear. Periodic colour patterns are a useful 
system for the investigation of this process because of their diversity, 
sophistication and visual accessibility, and because the cellular, deve- 
lopmental and molecular mechanisms that underlie spots and stripes in 
mammals remain largely unknown. Traditional model organisms, such 
as laboratory mice (Mus musculus), have been instrumental for iden- 
tifying genes that regulate pigment cell production, melanin synthesis 
and the pathways that alter the balance between two types of pigment: 
light pheomelanin and dark eumelanin!~*. However, it is unknown to 
what extent these pathways explain or even contribute to the diverse 
array of pigment patterns seen in wild mammals. Here we take advan- 
tage of the naturally occurring coat pattern of the African striped 
mouse, Rhabdomys pumilio (Muridae), to gain insights into the pro- 
cesses underlying the formation and evolution of mammalian stripes, 
a striking and characteristic pattern that has evolved independently in 
many taxa, including ungulates, carnivores, rodents, marsupials, lago- 
morphs and primates*®®, and is thought to confer a fitness advantage 
in a range of vertebrates””!°, 

Striped mice are diurnal, social rodents distributed throughout 
southwest Africa!!, whose dorsal coat is characterized by the presence 
of four dark and two light dorsal longitudinal stripes arranged in a 
dark-light-dark pattern (Fig. 1a). To understand how this stripe pattern 
is formed, we first characterized the distribution of hair and hair types 
across the body. In adult striped mice, hair can be classified into one of 
three phenotypic categories based on the individual pigment pattern— 
light, an unpigmented shaft with a eumelanic base; black, completely 
eumelanic from tip to base; and banded, a pheomelanic (yellow) shaft 
and eumelanic (black) base (Fig. 1b and Extended Data Fig. 1a). We 
quantified the proportion of each hair category in different regions 
along the dorsoventral axis (middle stripe, dark stripe, light stripe, flank 
and ventrum) and found that their proportions differed: the middle 
stripe and flank had similar proportions of all three hair types, but the 
light stripe contained mostly light hair, similar to what was found in 


the ventrum, whilst the dark stripe contained mostly black hair (Fig. 1b 
and Extended Data Fig. 1b). Thus, variation between light and dark 
stripes is largely determined by changes in the distribution of unpig- 
mented (light) and eumelanic (black) hair, rather than by differences in 
pigment-type switching (pheomelanin to eumelanin, and vice versa). 

To investigate how such spatial differences arise, we studied skin deve- 
lopment of striped mice between embryonic and early postnatal stages 
and observed stripe-like changes in both hair length and skin colour 
(Fig. 1c and Extended Data Fig. 2a—d). The hair length changes first 
became apparent at embryonic day 19 (E19; Extended Data Fig. 2a, b), 
but did not persist in adult mice (Extended Data Fig. 2e). By con- 
trast, the skin colour changes were first visible at E22, creating a stable 
arrangement and appearance that persists as the animal grows (Fig. Ic). 
At birth, both phenomena are present and are correlated with both 
morphology and gene expression (Extended Data Fig. 2c, d, f). Thus, 
the patterning mechanisms that underlie adult stripes begin during 
embryogenesis and can be visualized by changes in the organization 
of mesenchymal tissues. 

By postnatal day 2 (P2), light and dark stripes had similar levels of 
cell proliferation and hair follicle densities (Extended Data Fig. 3). 
However, histological sections of skin showed that hair follicles from 
the light stripe showed a large reduction in melanin deposition in the 
hair bulb relative to follicles from the dark stripe (Fig. 1d). Although 
hair follicles from both light and dark stripes contained pigment cells, 
as shown by immunohistochemistry for the pigment cell-marker KIT 
(Fig. le), those from light stripes exhibited less immunohistochemistry 
staining against microphthalmia-associated transcription factor 
(MITF) (Fig. 1f-h), a key transcription factor that promotes melanocyte 
differentiation and melanogenic gene expression'”. The extent of KIT 
staining at the base of hair follicles was indistinguishable between dark 
and light stripes (Fig. li), despite the large differences in MITF staining 
and visible pigmentation. We used quantitative PCR (qPCR) to measure 
the level of gene expression for key melanogenic genes and observed 
that the light stripe had lower expression of both tyrosinase (Tyr) 


1Howard Hughes Medical Institute, Departments of Organismic & Evolutionary Biology and Molecular & Cellular Biology, Museum of Comparative Zoology, Harvard University, Cambridge, 
Massachusetts 02138, USA. @HudsonAlpha Institute for Biotechnology, Huntsville, Alabama 35806, USA. ?Department of Genetics, Stanford University School of Medicine, Stanford, California 
94305, USA. “Instituto de Investigaciones Biomédicas Alberto Sols (CSIC/UAM) and Ciber de Diabetes y Enfermedades Metabdlicas Asociadas (Ciberdem), Madrid 28029, Spain. 5Center for 
Interdisciplinary Research in Biology, Collage de France, Paris 75005, France. °Université de Strasbourg, CNRS, IPHC UMR 7178, F-67000 Strasbourg, France. ’School of Animal, Plant and 
Environmental Sciences, University of the Witwatersrand, Johannesburg 2000, South Africa. 8Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA. 


518 | NATURE | VOL 539 | 24 NOVEMBER 2016 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


ARTICLE 


Light Black Banded 
b Oo Vf o f G «¢ d 
f) oO 
21.0 8 
no 
= e 
= @ 
5 a 
i= 
8 
6 
a 
° 
o oO 
2 
SY 2 2 
ws 5 
i 
aia 
e 
250 tek 
it parma —_ oO 
2 a =2 105 3 3 150} Sa 
5 ; a Es tT 8% * 
rf Zz 0 ° 0 LL 
sees Dark Light Dark — Light 
LC g 505 NS Dark 2 eal I“ Tyr 
2 7. Se 40, tight | ot io 
= , x< i: i ar 
: a 2 oe 304 “7 
oo 0) Ss 2 
2 , # 5s 204 Dark 2 ede Tyrp1 
a ’ i=)) = 3 Ligh |" 
os a £2 104 = ight 7 abo I 
4 0 Dark 1 @ 
3 Pigment + 
— g Dark — Light 012345 


Figure 1 | Phenotypic characterization. a, Coat pattern of an African 
striped mouse. Inset, detail of the coat pattern. b, Proportion of each 
hair type in adults (n = 5; error bars represent s.e.m.). ¢, Dorsal pattern 
at embryonic day (E)22, postnatal day (P)0 and P2. d-f, Skin sections 
from P2 dark and light stripes stained with haematoxylin and eosin (d), 
and with immunohistochemistry antibodies for KIT (e) and MITF (f). 
Bright field images (e, f) depict pigment. Dotted lines (e, f) delineate the 
hair bulb. g, h, Number of detectable MITF* cells (g, dark versus light, 


and tyrosinase-related protein] (Tyrp1) (Fig. 1j). Taken together, these 
results suggest that the differences in the proportions of light and black 
hair in light and dark stripes, respectively, are explained largely by 
differences in levels of Mitf, extent of melanocyte differentiation and 
amount of melanogenic gene expression. 


Spatial differences in gene expression 

We used RNA sequencing (RNA-seq) to carry out an unbiased survey 
of transcriptome differences that correlate with stripe identity. Three 
different skin regions (light stripe, dark stripe and flank) from each of 
three individuals per stage were investigated at four stages (E19, E22, PO 
and P2; n = 12; 36 libraries). For E19, we used hair length as a proxy to 
mark and isolate incipient pigmentation stripes (Extended Data Fig. 2). 
Our initial analyses were carried out using, as a reference, either the 
M. musculus genome or a striped mouse transcriptome that we assem- 
bled and annotated in-house. The results from the two approaches 
exhibited considerable overlap but alignment to the de novo transcrip- 
tome reference captured a richer and more comprehensive differential 
signature (Extended Data Fig. 4a—c) and is described here. Of the more 
than 17,000 genes that we annotated in the striped mouse transcrip- 
tome, 1,062 exhibited significant differential expression between two 
regions (false discovery rate (FDR) < 0.1, using the negative binomial 
generalized linear model implemented in DESeq2, for which both stage 
and region are considered factors). The largest number of differentially 
expressed genes between regions was observed for the flank compared 
to the light or the dark stripe (Extended Data Fig. 4d-f), which prob- 
ably reflects a difference in body region or tissue composition rather 
than a colour-pattern-specific difference. Among 36 genes that were 


Relative mRNA levels 


two-tailed t-test; m = 60) and their fluorescence intensity (h, dark versus 
light, two-tailed t-test; n = 34). i, Extent of KIT stain (dark versus light, 
two-tailed t-test; n = 20, P=0.5429). In g-i, boxes represent the 25th to 
75th percentile, whiskers show the minimum and maximum value, and the 
horizontal line indicates the median. j, Quantitative PCR of melanogenic 
genes at P2 (ANOVA followed by a Tukey—Kramer test; n = 4 (individual 
dots); red lines depict the mean). Scale bars are 100|1m (d) or 501m (e, f). 
*P< 0.05; **P< 0.01; ***P< 0.001; NS, not significant. 


significantly upregulated (FDR < 0.1) in the dark stripe compared 
to the light stripe, there is an obvious signature of melanocyte pig- 
ment production (Tyr, Tyrp1, Mc1r, Oca2, Gpr143, Trpm1; Fig. 2a and 
Supplementary Table 1a). Several of these genes are direct targets of 
Mitf, but we did not observe a difference in Mitf mRNA levels between 
dark- and light-striped skin, probably because Mitfis also expressed in 
non-pigmentary skin cells outside the hair follicle. 

Among 28 genes that were significantly upregulated (FDR < 0.1) 
in the light compared to the dark stripe, a clear functional signa- 
ture did not emerge; however, our attention was drawn to Alx3, 
which encodes a paired-class aristaless-like homeoprotein that has 
previously been implicated in fate specification of mesenchymal 
tissues!>-!©, As described below, the temporal and spatial expression 
of Alx3 make it a strong candidate for regulating colour differences 
among stripes. Furthermore, Alx3 showed the highest fold differ- 
ences in transcript abundance in the light stripe relative to dark stripe 
and the flank (6.73 and 4.32-fold, FDR 1.16 x 107!” and 4.09 x 10713, 
respectively; Fig. 2a, Extended Data Fig. 4e and Supplementary 
Table la, b). 

To further investigate a potential role of Alx3 in stripe patterning, we 
examined its spatial and temporal expression during skin development 
and stripe formation. Quantitative RT-PCR showed that Alx3 mRNA 
was elevated in the incipient light compared to dark stripes at E19 and 
that the difference increased at E22, after which it remained but was 
reduced at PO (Fig. 2b). This pattern contrasts with that observed for 
other pigmentation-related genes including Asip, Edn3, Tyr and Tyrp1 
(Extended Data Fig. 5), for which differential expression is not apparent 
until E22 or later. 
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Figure 2 | Alx3 is a candidate for regulating spatial differences in hair 
colour. a, Differential expression of transcripts in light versus dark 
stripes (n = 145; FDR < 0.1); higher expression in light (yellow) or 

dark stripe (blue). Known pigmentation genes are labelled, those with 
differential expression (n = 9; dark yellow, dark blue) and without 
differential expression (1 = 8; pink). b, Quantitative PCR of Alx3 at three 
stages (ANOVA followed by a Tukey—-Kramer test; n= 4; ***P < 0.001. 
Red lines depict the mean). c, d, In situ hybridization shows Alx3 


Alx3 expression during development 

Using in situ hybridization, we examined the expression of Alx3 during 
the early stages of skin development. Previous work on Alx3 during 
embryogenesis (E10.5-E12.5) in M. musculus showed expression in 
neural-crest-derived mesenchyme and in lateral plate mesoderm". In 
sections from E13.5 and E15.5 M. musculus embryos, we confirmed this 
pattern of expression, observing Alx3 mRNA in lateral mesenchyme 
at E13.5, extending to and predominating in ventral mesenchyme by 
E15.5 (Fig. 2c). A similar pattern is present in striped mice (Fig. 2d), but 
striped mouse embryos show an additional domain of Alx3 expression 
in the developing dorsal skin (red arrowheads in Fig. 2d), which corre- 
sponds anatomically with the future position of the stripe domain. At 
these early (E15) stages, ALX3 is primarily expressed in epithelial cells, 
as indicated by combined immunohistochemistry with E-cadherin, an 
epidermal compartment marker (Fig. 2e). At PO, when Alx3 mRNA 
remains elevated in light compared to dark stripe skin, immunostaining 
for ALX3 was found in cells in developing dorsal hair bulbs (Fig. 2f). 
Combined immunostaining with antisera for S100 (Fig. 2g), a marker 
for neural crest-derived cells!”, shows that hair bulb expression of ALX3 
includes melanocytes as well as keratinocytes (Fig. 2h). These results 
demonstrate that establishment of a cellular compartment correspond- 
ing to the stripe domain occurs early during skin development, indicate 
that Alx3 is a participating and/or responding factor in pattern estab- 
lishment, and suggest multiple pathways—a direct effect on pigment 
cells and/or an indirect effect on hair bulb keratinocytes—through 
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which alterations in melanocyte Mitf expression and pattern imple- 
mentation may take place. 


Alx3 has a melanocyte autonomous effect 
To further investigate the potential relationship between Alx3, Mitfand 
pigment production, we carried out both gain- and loss-of-function 
experiments in cultured cells. For gain-of-function experiments, a 
lentiviral construct in which Alx3 and a GFP reporter were driven by 
a PGK promoter (LV-A/x3:GFP; Extended Data Fig. 6a) was trans- 
duced into B16-F1 cells, an M. musculus melanocyte cell line that 
expresses Alx3 (Extended Data Fig. 6b); control cells were transduced 
with the same construct which lacked Alx3 (LV-GFP)!® (Extended 
Data Fig. 6a). We found that cells carrying the full length Alx3 
construct (LV—Alx3:GFP) exhibited a marked decrease in mRNA 
levels of Mitf and Silver, a key melanogenesis gene and Mitf target 
(Extended Data Fig. 6c). For loss-of-function experiments, we 
used short hairpin RNAs (shRNAs) against Alx3. Three of the four 
shRNA lentiviral constructs tested (shRNA1I, 2 and 3) caused Alx3 
mRNA levels to decrease relative to cells containing a scrambled 
shRNA control (Extended Data Fig. 6d). The same three constructs 
caused Mitf and Silver mRNA levels to increase. Thus, these experi- 
ments indicate that Alx3 can negatively regulate Mitf in a cell- 
autonomous fashion. 

To investigate a potential non-cell-autonomous effect of Alx3 on Mitf 
expression, we transduced primary M. musculus keratinocytes with 
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Figure 3 | Alx3 decreases melanin synthesis in vivo. a, Ultrasound- 
assisted injection of a GFP-tagged lentivirus results in high-efficiency 
stable transduction of the skin, as shown by a M. musculus embryo injected 
at E8.5 and analysed at E18.5. b, A sagittal section of a hair follicle from 

a lentivirus-infected M. musculus embryo shows transduction (GFP* cells) 
of melanocytes (GFP* KIT* cells indicated by arrowheads). c—h, Hair 
follicles from samples injected with LV-GFP control (c-e) and LV-Alx3: 
GFP (f-h) depicting immunohistochemistry for MITF (c, f), virus- 
transduced cells (d, g), and merged images showing MITF* GFP* cells 

(e, h) (arrowheads). Dotted lines (b—-h) delineate the hair bulb. i, Number 
of detectable MITF* cells (LV-Alx3:GFP versus LV-GEP; two-tailed 

t-test; n = 60, ***P < 0.001). Boxes represent the 25th to 75th percentile, 
whiskers show the minimum and maximum value, and the horizontal line 
indicates the median. j, Quantitative PCR of mRNA levels of melanogenic 
genes (LV-GFP versus LV—Alx3:GFP; two-tailed t-test; n = 3 (individual 
dots); red lines depict the mean; ***P < 0.001). 


LV-Alx3:GFP or LV-GFP and co-cultured them with wild-type B16-F1 
melanocytes using a cell-culture-insert system (Extended Data Fig. 7a). 
Melanocytes exposed to keratinocytes stably expressing LV-Alx3:GFP 
did not differ in their Mitf, Tyr or Silver mRNA levels, relative to those 
co-cultured with LV-GFP expressing keratinocytes (Extended Data 
Fig. 7b). In addition, we obtained a similar response when wild-type 
melanocytes were co-cultured with LV-Alx3:GFP transduced melanocytes 
(Extended Data Fig. 7c, d). These experiments show a reciprocal relation- 
ship between Alx3 and Mitf, in which expression of Mitf mRNA and its 
melanogenic gene targets can be either up- or downregulated in response 
to inhibition or overexpression, respectively, of Alx3 mRNA. Our results 
further indicate that regulation of Mitf expression and melanogenic gene 
expression by Alx3 occurs via a melanocyte-autonomous process. 


Alx3 decreases melanogenesis in vivo 

We next examined the effect of Alx3 in vivo using the same lentiviral 
constructs described above and ultrasound-assisted in utero injections 
into E8.5 mouse embryos. At this stage, before neural tube closure, 
injected lentivirus infects skin epidermis and cells originating from the 
neural crest, including melanocyte precursors (Fig. 3a, b). We observed 
GFP expression in cells in the centre of the hair follicle as well as the 
upper periphery; co-staining with KIT confirmed that many or most 
of the GFP-positive cells in the centre of the follicle were melanocytes 
(Fig. 3b). 

Hair follicles from mice sampled at P4, a stage during which melanin 
synthesis is active’, that were transduced with LV—Alx3:GEP had a 
marked reduction in the number of MITF* GFP* cells compared to the 
control, as revealed by immunohistochemistry (Fig. 3c-i). In addition, 
LV-Alx3:GFP-transduced melanocytes, isolated by fluorescence- 
activated cell sorting (FACS), showed a decrease in mRNA levels of 
melanin synthesis markers (Tyrp1, Tyr and Silver) relative to control 
cells (Fig. 3)). To determine whether the effect of Alx3 overexpression 
reduced the number of melanocytes or just their expression of Mitf, 
we performed immunohistochemistry against SOX10, a melanocyte 


ARTICLE 


a b Mitt Mitt Mitt 
Mitf promoter Site3  Site5 Site10 Tyr 
123 45 6789 10 ee 15 . 
-- — —@—9—00—9-® -- z fal an fal Ns, 
£10 ° 
Mammal conservation s 
5 5) on 
nol aaa fo) 
ro) ° 9 2S 
om am ° cc) oo 3) aE OP 
0—— — — —- 
S > S49 G1 © 
EP LH LN SS 
Ss y <4 nd 
s s s s 


Figure 4 | Alx3 binds to the Mitf promoter directly. a, Top, approximate 
locations of putative Alx3 binding sites (labelled 1-10), conserved in 

M. musculus and striped mouse (red circles) and across mammals (orange 
circles), along the 1.5-kb region of the Mitf M promoter. ATG signals the 
start of the coding sequence. Bottom, plot of evolutionary conservation 

for the 1.5-kb region of the Mitf M promoter in placental mammals, 
identified in silico (http://genome.ucsc.edu) with sites predicted to be 
conserved (blue bars) and those less conserved (brown bars). b, Chromatin 
immunoprecipitation-quantitative PCR (ChIP-qPCR) assays showing 
amplification of Mitf chromatin corresponding to different regions of the 
promoter immunoprecipitated with an anti-ALX3 antibody or with control 
non-immune rabbit serum (NRS) (anti- ALX3 versus NRS; two-tailed 
t-test; n= 4 (individual dots); red lines depict the mean; *P < 0.05; 

NS, not significant). 


marker upstream of Mit, 219-21 and found that the number of SOX10+ 
GFP* cells in LV-Alx3:GFP and control samples did not differ 
(Extended Data Fig. 8a-g). Furthermore, we did not detect a difference 
in the number of keratinocytes, as determined by counts of K14+ GFP* 
cells inside hair follicles, or in hair follicle density (Extended Data 
Fig. 8h-o). The ability of elevated levels of Alx3 to suppress expression 
of Mitf and melanocyte differentiation in experiments with M. musculus 
recapitulates the differences seen between the light and dark stripes 
of striped mice. 


Alx3 directly represses Mitf 

The histological and genomic data point to several pathways that 
contribute to stripe differences, in which an Alx3-Mitf interaction 
may play an early and predominant role. We next used in vivo and 
in vitro protein-DNA interaction experiments to gain additional 
insight into how Alx3 downregulates Mitf. Alx3 can selectively bind 
to a DNA-consensus sequence containing a TAAT motif”. The Mitf 
gene contains nine distinct promoters, of which one, 1M, is selec- 
tive to melanocytes and prominently active in these cells!*. In an 
approximately 1.5-kb region upstream of the M. musculus Mitf M 
transcriptional initiation site, including regions known to be relevant 
for transcription factor binding’, we identified ten candidate TAAT 
binding sites conserved between striped mice and M. musculus, four 
of which are also conserved across mammals (labelled 1-10; Fig. 4a 
and Extended Data Fig. 9). Electrophoretic mobility shift assays 
(EMSA) with nuclear extracts of B16-F1 cells showed that binding 
sites 3, 5 and 10 generated sequence-specific DNA-protein complexes 
(Extended Data Fig. 10a, b). The addition of ALX3 antiserum”* dis- 
rupted the formation of these complexes, demonstrating that ALX3 
binds to the oligonucleotide probes for these three sites (Extended 
Data Fig. 10a). 

We then carried out chromatin immunoprecipitation (ChIP) assays 
on B16-F1 cells and found that Mitf promoter sequences that contain 
candidate binding sites 3, 5 and 10 were selectively amplified by qPCR 
from chromatin immunoprecipitated with the anti- ALX3 antiserum, 
but not with control non-immune serum (Fig. 4b). Promoter sequences 
from the Tyr gene, used as a control, were not amplified from the anti- 
ALX3 immunoprecipitated chromatin (Fig. 4b). Thus, ALX3 binds 
to three of the predicted binding sites immediately upstream of the 
melanocyte Mitf promoter both in vitro and in vivo. 

We also tested the effect of Alx3 overexpression on Mitf promoter 
activity in B16-F1 cells. Using a luciferase reporter cloned downstream 
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Figure 5 | Hair colour patterning mechanisms in rodents. a, Chipmunks 
independently evolved a dorsal pattern that resembles that of striped 
mice. Photo: J. Sartore, National Geographic Photo Ark. b, Quantitative 
PCR of Alx3, Asip and Edn3 mRNA levels along the dorsoventral axis 

in chipmunks. Differences evaluated by ANOVA followed by a Tukey- 
Kramer test; 1 = 4 (individual dots); *P < 0.05; **P < 0.01; ***P< 0.001. 
Red lines depict the mean. c, Combination of a melanocyte autonomous 
pathway, mediated by Alx3, and a non-autonomous pathway, modulated 
by paracrine factors (Edn3 and Asip), may explain variation in rodent 
pigmentation patterns, specifically, and in mammals, generally. 


of Mitf promoter sequences, we found that overexpression of Alx3 
caused a decrease in luciferase expression relative to GFP overexpres- 
sion (Extended Data Fig. 10c). Furthermore, when we mutated the 
TAAT motifs relevant for ALX3 interactions as identified by our EMSA 
experiments (that is, sites 3, 5 and 10), luciferase activity between GFP- 
and Alx3-expressing cells did not differ, indicating that Alx3 could no 
longer suppress expression (Extended Data Fig. 10c). As a control, 
we mutated candidate binding site 1, which did not show binding in 
our EMSA experiments and found that Alx3-transfected cells had a 
decrease in luciferase activity relative to GFP-transfected cells, similar 
to what was seen with the wild-type Mitf promoter (Extended Data 
Fig. 10c). Together these results provide strong evidence that Alx3 
represses Mitf in melanocytic cells by binding directly to specific 
sequences in the Mitf promoter. 


Convergent evolution of striped rodents 

Colour patterns in striped mice are very similar to, but evolved inde- 
pendently from, those seen in Eastern chipmunks (Tamias striatus), 
a sciurid rodent that shares a last common ancestor with murids 
about 70 million years ago (refs 25-27) (Fig. 5a). In skin biopsies 
from adult Eastern chipmunks, we measured expression of Alx3 
using qPCR and found that mRNA levels were higher in the light 
stripe than in the dark stripe and flank (Fig. 5b). We also measured 
expression of Asip and Edn3, which encode paracrine regulators of 
pigment-type switching, and compared the results in striped mice 
(Extended Data Fig. 6a, b) to those in chipmunks (Fig. 5b). Overall, 
Asip mRNA levels were elevated in light areas (the light stripe, flank 
and ventrum in striped mice and the flank and the ventrum in chip- 
munks), whereas Edn3 mRNA levels were elevated in dark areas (the 
dark stripes in both species). Thus, Alx3-mediated repression of Mitf 
in melanocytes and paracrine regulation of pigment-type switching 


522 | NATURE | VOL 539 | 24 NOVEMBER 2016 


are common developmental mechanisms that originated repeatedly 
during mammalian evolution to generate stripe patterns. 


Discussion 

In non-mammalian vertebrates, mechanisms that underlie colour 
patterns depend on specialized types of pigment cells that become 
organized into specific arrangements during development*”*°. But 
in mammals, only a single type of pigment cell exists, the melanocyte, 
and, in general, localized differences in colour are generated by het- 
erogeneity in gene expression rather than heterogeneity in cellular 
distribution! ". For stripes and spots in felids, pigment-type switch- 
ing controlled by Asip and Edn3 is the major determinant of colour 
pattern’, but for dorsal stripes in striped mice, suppression of 
melanocyte development is a main driver of colour pattern. Indeed, 
Alx3-induced inhibition of Mitf expression without loss of melanocytes 
represents a previously unrecognized evolutionary tool for the local 
modulation of colour pattern. 

In striped mice and chipmunks, both Alx3-mediated repression of 
Mitf and pigment-type switching modulate hair colour. For example, 
increased expression of both Asip and Alx3 distinguish the ventrum 
from the flank and increased expression of Edn3 distinguishes the 
dark stripes from surrounding regions. This interplay between 
melanocyte-autonomous (Alx3-dependent) and non-autonomous (par- 
acrine regulation of pigment-type switching) pathways may help explain 
patterns composed of multiple intensities of pigment (for example, 
pattern elements that are almost devoid of pigment (Alx3-mediated 
repression of Mitf) mixed with elements that are yellow or orange 
(pigment-type switching)) (Fig. 5c). Future efforts, directed towards 
developing gene-editing approaches in striped mice and chipmunks, 
will dissect the relative contribution of each pathway to the resulting 
phenotype. 

From an evolutionary perspective, we show that laboratory mice, 
striped mice and chipmunks all express Alx3 in ventral skin, but only 
the latter two species upregulate Alx3 in their dorsum. While Alx3 and 
Asip*? contribute to pale-coloured ventrums, we hypothesize that Alx3 
was subsequently co-opted and expressed in an additional dorsal domain, 
ultimately giving rise to light-coloured stripes. It is not yet clear whether 
the developmental cues that direct Alx3 expression and stripe formation 
in striped mice and chipmunks originate from an organizing centre, such 
as the neuroectoderm, or arise spontaneously (for example, through a 
reaction-diffusion mechanism), but it is possible that the same princi- 
ples operate broadly across mammalian taxa. Finally, although striped 
mice and chipmunks share a similar dark-light-dark stripe pattern, the 
location of the stripes along the dorsoventral axis differs considerably, 
with stripes in chipmunks situated more laterally than in striped mice. 
In addition, stripe number varies widely among rodents, from the single- 
striped grass mouse (Lemniscomys rosalia) to the thirteen-lined ground 
squirrel (Ictidomys tridecemlineatus). From this perspective, a detailed 
understanding of the spatial regulation of Alx3 in rodents, and other 
striped mammals, could lead to potential insights into the evolution of 
gene regulation and the developmental basis of evolutionary novelty. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Striped mouse breeding colony. F10 descendants of wild-derived striped mice 
(R. pumilio, originating from Goegap Nature Reserve, South Africa, $ 29° 41.56’, 
E 18° 1.60’) were obtained from a captive colony at the University of Zurich 
(Switzerland) and are now maintained at Harvard University. They are kept at a 
16:8 light-dark cycle and given food ad libitum. Developmental stages were inferred 
from morphological similarities with M. musculus embryos. Harvard University’s 
IACUC committee approved all experiments. 

Phenotypic characterization. Adults. We identified three main hair types based 
on their individual pigment pattern: black, banded and light. To characterize the 
pigment pattern along the dorsoventral axis, we quantified the proportion of each 
of these hair types in 1 mm hair plugs taken from each dorsal stripe, the flank and 
the ventrum of five adult mice. In addition, we scored the number of guard, awl and 
zigzag hair found in each region. To determine hair length, we placed hairs from 
the hair plugs on microscope slides, mounted them with glycerol and measured 
their length using Axiovision Microscopy Software (Zeiss). 

Embryos and pups. We fixed embryos with 4% paraformaldehyde, dissected the 
skin and mounted it on glass slides (dermal side up). For estimating hair follicle 
density in pups, we detached a portion of each stripe from the muscle, while leaving 
the ends attached, embedded samples in OCT (Fisher Scientific), cryosectioned 
them transversally and stained them with haematoxylin and eosin. This technique 
allowed us to count individual hair follicles and assign them to the specific region to 
which they belonged (light stripes, dark stripes or flank). We counted the number 
of hair follicles and estimated the surface area of the tissue using Image]*°. Since 
our phenotypic characterization (Fig. 1b and Extended Data Figs 1b, 3b) and gene 
expression patterns determined by qPCR (Figs 1j and 2b) showed no differences 
between the two dark stripes, we carried out most of our analyses with dark stripe 1 
(closest to the midline). We manually quantified the number of MITF* cells per 
hair follicle, as detected with the antibody. For quantification of MITF fluorescence, 
we obtained images from hair follicles in the light and dark stripes, outlined stained 
cells, and measured the integrated density using ImageJ*°. To obtain the corrected 
total cell fluorescence (CTCF), we multiplied the area of each selected cell by the 
mean fluorescence of the background readings and subtracted that value from 
the integrated density of stained cells**. To quantify the extent of KIT staining, 
we obtained images from hair follicles in the light and dark stripes, delineated 
the hair bulb area and measured the proportion of the area that was stained with 
KIT* using ImageJ. Data were obtained from three pups and three embryos. All 
counts were done blind and samples were randomized. Statistical differences were 
determined using two-tailed t-tests or ANOVA (sample sizes and statistical tests 
used are indicated in figure legends). 

Cell proliferation. We injected the peritoneum of striped mouse pregnant females 
with 101g g! of EdU (5-ethynyl-2'-deoxyuridine) (ThermoFisher Scientific) two 
days before birth and collected pups at P2. To measure proliferating cells, we micro- 
dissected dark and light stripes, embedded them in OCT for posterior cryosectioning 
and used the Click-iT EdU Imaging kit (ThermoFisher Scientific), following the man- 
ufacturers’ protocol. We counted proliferating cells in the epidermis and hair follicles 
from pictures of light and dark stripes. Data were obtained from three individuals. All 
counts were done blind and samples were randomized. Statistical differences were 
established using two-tailed t-tests (sample sizes are provided in the figure legend). 

Immunohistochemistry. Striped mouse embryos were fixed in 4% paraformal- 
dehyde, embedded in OCT/sucrose, and sectioned using a cryostat (CM 3050S, 
Leica). We performed immunohistochemistry using anti-MITF (Abcam 80651; 
1:100), anti-ALX3 (Abcam 64985; 1:500), anti-KIT (DAKO A4502; 1:1,000), anti- 
E-cadherin (Millipore ECCD-2; 1:200), anti-S100 (Abcam 4066; 1:200), and anti- 
SOX10 (Abcam 27655; 1:100). We visualized reactions with Alexa-dye-conjugated 
secondary antibodies (Molecular Probes) at 1:500 dilution in 3% bovine serum 
albumin (BSA), PBS and Tween or with biotinylated goat anti-rabbit (Jackson 
Labs) and tyramide signal amplification (Perkin Elmer). For controls, we incubated 
sections with PBS instead of primary antibodies, but no specific cellular staining 
was observed. Cell nuclei were stained with DAPI (Southern Biotech). We exam- 
ined sections using a LSM 700 confocal microscope and an A1 Imager (Zeiss). All 
pictures are representative of at least three individuals. 

Quantitative PCR (qPCR). We separated the skin from the muscle and microdissected 
skin tissue corresponding to different regions (that is, dark stripe 1, light stripe, 
dark stripe 2, flank and ventrum) at the different time points indicated throughout 
the text. We then extracted total RNA using the fibrous tissue RNeasy kit (Qiagen), 
which included a DNase on-column treatment. Using qScript CDNA SuperMix 
(Quanta BioSciences), we generated complementary DNA (cDNA) and then per- 
formed qPCR using PerfeCTa SYBR Green FastMix (Quanta BioSciences). We used 
40 cycles of amplification on an Eppendorf Mastercycler. For analysis of striped mice, 
we designed primers along sites that were conserved across mice and rats. For chip- 
munk samples, we designed primers along sites that were conserved across mice and 
thirteen-lined ground squirrels (all primers sequences are listed in Supplementary 


Table 2). For measurements of Mitf expression in M. musculus samples, we used 
validated qPCR primers from the PrimerBank database*”. We assayed gene expression 
in triplicate for each sample and normalized the data using the housekeeping gene 
6-actin. Samples used for qPCR correspond to different individuals than those used 
for RNA-seq analysis. We analysed data from all qPCR experiments using the com- 
parative C, method*®, and established statistical significance of expression differences 
using either ANOVA followed by a Tukey-Kramer test or two-tailed t-tests (sample 
sizes and specific statistical tests used are given in each figure legend). 

RNA sequencing. For each of the time points described in the text, we dissected 
skin tissue (dark stripe 1, light stripe, flank) and extracted RNA as indicated for 
qPCR. We used RNA from three different regions (light stripe, dark stripe and 
flank) from each of three individuals for four different stages (E19, E22, PO, P2; 
n= 12; 36 libraries in total). We prepared cDNA libraries for each sample using 
Illumina’s TruSeq RNA Library Preparation Kit v2. We multiplexed individual 
libraries (six per lane) and sequenced them as paired-end 50-bp reads on an 
Illumina HiSeq 2000 instrument at the Genome Sequencing Laboratory of the 
HudsonAlpha Institute. We used Cutadapt software (version 1.8.1) to trim RNA- 
seq reads for residual adaptors and low quality sequences. Since a good quality 
reference genome is not currently available for the striped mouse, we used a dual 
exploratory strategy to assess differential expression between skin regions across 
various developmental stages. First, we aligned the trimmed RNA-seq reads against 
the M. musculus reference genome version using genomic sequence and transcript 
annotations obtained from Ensembl (release 80) and the STAR aligner software 
(version 2.5.0b). In parallel, we assembled trimmed RNA-seq reads into a de novo 
transcriptome using the Trinity suite of tools (version 2.1.0). The resulting de novo 
transcriptome assembly was subsequently annotated using an in-house annotation 
procedure supported by the human reference exon sequences retrieved from the 
Ref-Seq sequence database (GRCh37/hg19, release 55; http://www.ncbi.nlm.nih. 
gov/refseq/). Briefly, to associate a specific gene entity to each de novo assembled 
transcriptomic contig, we mapped our de novo assembly against a comprehensive 
database of reference human exon sequences using the Blast software (version 
2.2.22). We retained only alignments with a significant Blast Expected Value < 10“ 
for subsequent annotation purposes. Based on these alignments, we subsequently 
computed an ad hoc mapping score for each pair {assembly contig, gene entity} for 
which at least one significant exon alignment was identified. The mapping score 
was computed as the sum of the highest Blast alignment bit-scores at each posi- 
tion within a particular contig, associated with at least one significant alignment 
against an exon of the considered gene entity. Ultimately, the annotation procedure 
associated to each mapped contig the gene entity with the highest ad hoc mapping 
score. We then used the annotated de novo transcriptome assembly as reference 
for aligning trimmed RNA-seq reads using the Bowtie2 software (version 2.2.5). 

We computed gene counts from read alignments, obtained using either the 
M. musculus reference or the de novo transcriptome assembly, with three software 
tools included in the Trinity suite: eXpress (version 1.5.1), Kallisto (0.42.4) and REEM 
(version 1.2.23). We then used the individual sets of gene counts computed for each 
transcriptome reference and each abundance estimation tool to test for differential 
gene expression between samples from various skin regions with the DESeq2 package 
(version 1.10.1) from Bioconductor. The entire dataset (3 individuals, 3 regions, 
4 stages; n= 36 libraries) was analysed under the DESeq2 negative binomial generalized 
linear model, which is a powerful and robust approach for identifying genes that are 
differentially expressed, either between stages or between regions. In all analyses, we 
used a FDR < 0.1 as a statistical significance threshold. Results presented here depict 
region-specific two-way comparisons across all four stages, dark versus light (Fig. 2a), 
light versus flank (Extended Data Fig. 4e) and dark versus flank (Extended Data 
Fig. 4f), in which genes assessed as significant represent the intersection between the 
three abundance estimation approaches implemented in the Trinity suite. 

There are major changes in cell composition and skin development across the 
four stages we examined by RNA-seq (E19, E22, PO, P2), associated with large 
changes in gene expression profiles. Therefore to examine the relationship between 
light and dark stripes at different stages, we developed supervised learning mod- 
els in which the gene expression profile at one stage was tested as a predictor 
of stripe identity, light versus dark, and subsequent stages. After normalization 
and variance stabilization using R functions implemented in DESeq2, we carried 
out a principal component analysis (PCA), using R functions implemented in the 
FactoMineR (version 1.33) package, to identify variance components associated 
with light versus dark phenotype across all stages. The PCA results then were used 
to develop supervised learning models using the R functions implemented in the 
randomForest package (version 4.6-12), including optimization steps based on the 
top 5% of the most informative gene expression profiles associated with each stage. 
The results demonstrate the ability of learning models based on a specific region 
and stage to predict region identity, light versus dark, in other stages, in which the 
accuracy of the models is evaluated by averaging over 30 independent iterations 
(Extended Data Fig. 2f). 
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In situ hybridizations. For in situ hybridization, we generated species-specific 
riboprobes by cloning a 545-bp fragment of Alx3 from M. musculus and the striped 
mouse. We carried out section in situ hybridizations following protocols described 
previously**”” and visualized samples using an Al Imager (Zeiss). All pictures are 
representative of at least three individuals. 
Cell culture experiments. We purchased B16-F1 melanoma cells from ATCC and 
maintained them in DMEM with 10% fetal bovine serum (FBS, Sigma-Aldrich), 
100U ml ' penicillin, and 100,.g ml! streptomycin in a 37 °C incubator with 5% 
CO), at physiological pH 7.4. We grew cells to 70-80% confluency and performed 
all experiments within 10 passages. B16-F1 cells tested negative for Mycoplasma 
contamination. We cultured mouse keratinocytes and maintained them in 0.05 mM 
Ca** E-medium with 10% FBS serum, following previously established protocols!®. 
For gain-of-function experiments, we used the LV—Alx3:GFP and LV-GFP 
constructs described above. For loss-of-function experiments, we used five con- 
structs (four specific to Alx3 and one scrambled sequence) from existing RNAi 
lentiviral libraries"! (details and clone numbers listed in Supplementary Table 3). 
For viral infections, we plated cells in 6-well dishes at 300,000 cells per well and 
incubated with lentivirus in the presence of polybrene (100,.g m7). All infections 
were carried out in triplicate. After two days in culture, we selected infected cells 
using either puromycin (21g ml~}; shRNA constructs) or FACS (gain-of-function), 
and processed samples for mRNA analyses. For cell-culture-insert experiments, 
we plated wild-type cells on 0.4,1m transwell inserts (Falcon, BD) at 200,000 cells 
per ml and incubated them in plates containing a bottom layer of transduced cells 
(keratinocytes or melanocytes) for three days (see Extended Data Fig. 7a, c for an 
illustration of the experimental design). 
Ultrasound-assisted in utero lentiviral microinjections. For construction of 
LV-Alx3:GEP, we replaced the puromycin cassette of PLKO.1, a generic lentiviral 
vector containing the PKG promoter“, with a fragment containing Alx3 cDNA 
cloned from M. musculus (NM_007441.3), a P2A sequence and a histone-fluorescent 
protein gene fusion (Hist2h2be-eGFP). LV-GEP, which contains only the sequence 
coding for Hist2h2be-eGEP, was originally designed from PLKO.1 using a similar 
strategy'® (Addgene plasmid 25999). We carried out large-scale production of VSV-G 
pseudotyped lentivirus using calcium phosphate transfections of 293FT cells and 
helper plasmids, pMD2.G and psPAX2 (Addgene plasmids 12259 and 12260). 
Transfection conditions, subsequent viral concentration and titration followed 
established guidelines!®, For injections, we anaesthetized C57B1/6 females at E8.5 of 
gestation and injected embryos with 1.5, ofa constant viral titre. We collected trans- 
duced skin from P4 pups, which we used for immunohistochemistry, following pro- 
cedures outlined above, and for isolation of virus-infected melanocytes via FACS. For 
FACS, we used a KIT antibody (ebioscience 14-1171-81; 1:1,000), sorted KIT* GFP 
cells directly in TRIzol LS (Invitrogen), and extracted RNA following the protocol 
outlined in the TRIzol LS manual. We manually quantified the number of MITF* 
GFP* and SOX10* GFP* cells in hair follicles, as detected with our antibodies. To 
determine the number of keratinocytes per follicular area, we divided the number of 
K14* GFP* cells in hair follicles by the hair bulb area, using ImageJ*°. To determine 
hair follicle density, we obtained pictures of skin sections stained with DAPI and 
counted the number of hair follicles per tissue section area, using Image]*°. Data were 
obtained from three individuals per lentiviral construct injected. All counts were 
done blind and samples were randomized. Statistical differences were established 
using two-tailed t-tests (sample sizes for each analysis are indicated in figure legends). 
Comparative sequence analysis. We obtained comparative sequence data from 
publicly available nucleotide databases at NCBI (http://www.ncbi.nlm.nih.gov/ 
BLAST/). Evolutionarily conserved non-coding sequences were identified using 
the global sequence alignment tool incorporated in the UCSC genome browser 
(http://genome.ucsc.edu)”, PipMaker (http://bio.cse.psu.edu/pipmaker)*’, and 
LAGAN (http://lagan.stanford.edu/lagan_web/index.shtml)™. 
Electrophoretic mobility shift assays (EMSA). We conducted EMSA using 
nuclear extracts’ of melanoma B16-F1 cells prepared in the presence of protease 
inhibitors (complete protease inhibitor cocktail; Roche) and determined protein 
concentrations using the Bio-Rad protein assay. Synthetic complementary 
oligonucleotides were annealed and labelled using [-**P]-ATP and T4 kinase. 
We performed binding reactions at room temperature in the presence of 
20,000 counts per min (c.p.m.) of radiolabelled probe (approximately 6-10 fmol) 
in a volume of 2011 containing 2 1g poly (dI-dC), 20mM HEPES (pH 7.9), 70 mM 
KCl, 2.5mM MgCh, 1 mM dithiothreitol, 0.3 mM EDTA and 10% glycerol. We then 
added competitor oligonucleotides of identical (specific) or unrelated (non-specific) 
sequences to the probe at the indicated fold molar excess. The sequences of the 
oligonucleotides used are listed in Supplementary Table 2. When indicated, 
we added specific antiserum” or control non-immune rabbit serum (NRS) to the 
binding reaction. The reaction mixtures were resolved using 5% non-denaturing 
polyacrylamide gels, which were subsequently dried and autoradiographed. 
Western blots. We prepared cell lysates from B16-F1 cells, resolved by SDS- 
PAGE, and blotted them onto a BioTrace PVDF membrane (Pall Corporation). 
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We detected ALX3 immunoreactivity with a rabbit polyclonal primary antiserum 
(1:4,000 dilution)” and a horseradish peroxidase-conjugated goat anti-rabbit 
secondary antibody (1:10,000 dilution; Bio-Rad Laboratories). To detect ACTIN 
we used a mouse monoclonal antibody (1:10,000 dilution, clone AC-15; Sigma) and 
a horseradish peroxidase-conjugated goat anti-mouse antibody (1:5,000 dilution; 
Bio-Rad Laboratories). We visualized immunoreactive bands using an ECL detection 
system (GE Healthcare). 

Chromatin immunoprecipitation (ChIP)-qPCR assays. We performed ChIP assays 
as previously described** using B16-F1 cells treated with 1% formaldehyde. We isolated 
the cross-linked protein-DNA complexes and, after sonication, we incubated chro- 
matin with an ALX3 antiserum”! or with control NRS. Next, we isolated antibody- 
protein-DNA complexes by incubation with protein A-sepharose. To detect bound 
DNA, we carried out qPCR on triplicate samples using oligonucleotide primers that 
amplify fragments of the Mitf gene corresponding to the regions containing sites 3, 
5 or 10. As acontrol, we used promoter sequences from the Tyr gene as described”. 
Oligonucleotides used in ChIP assays are listed in Supplementary Table 2. 
Luciferase assays. We amplified a 1.5-kb region of the Mitf M promoter from 
M. musculus and subsequently cloned it into the SacI-HindIII sites of the 
pLightSwitch_Prom luciferase reporter vector (Switchgear Genomics, Active 
Motif). We then generated additional luciferase constructs from the wild-type 
construct by mutating the different Alx3 binding motifs (TAAT to GCCG) using 
the Q5 site-directed mutagenesis kit (New England Biolabs). We verified all con- 
structs by sequencing. We next transfected B16-F1 melanocytes with LV—Alx3:GFP 
and LV-GFP by using FuGENE HD (Active Motif). Using FACS, we selected the 
stable transfected clones and confirmed overexpression of Alx3 by qPCR. The day 
before the transfection, we seeded cells at a density of 1 x 10* cells per well and 
16h later transfected them with the different Mitf constructs using a FuGENE 
HD to plasmid DNA ratio of 3:1 (300 nl FuGENE HD to 100ng plasmid DNA 
per well). We then harvested cells and processed them using the LightSwitch 
luciferase assay kit (Switchgear Genomics) following the protocol guidelines and 
measured luciferase using a SpectraMax L luminometer (Molecular Devices). We 
normalized luciferase activity relative to luminescence from cells transfected with 
the pLightSwitch_Prom luciferase reporter vector (empty vector). We did not 
observe a difference in luciferase activity when we transfected our two stable cell 
lines (LV—Alx3:GFP and LV-GFP) with an empty vector (pLightSwitch_Prom) 
or a vector containing the promoter for a housekeeping gene (ACTB_PROM). 
We performed all luciferase experiments using five replicates per construct and 
established the statistical significance of luminescence differences using two-tailed 
t-tests (sample sizes for each experiment are indicated in the figure legends). 
Chipmunk (T. striatus) samples. We collected T. striatus at Harvard University’s 
Concord Field Station (Concord) using Sherman live traps (Massachusetts state 
permit: 027.14SCM). Chipmunks were euthanized, skin punches were taken from 
the different body regions, and samples were processed for qPCR as indicated above. 
Data availability. The striped mouse de novo transcriptome assembly has 
been deposited in https://datadryad.org/ under accession number doi:10.5061/ 
dryad.7v222. 
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Extended Data Figure 1 | Hair characterization in adult striped mice. a, Striped mice have three different phenotypic categories of hair (light, black 
and banded) based on individual pigment pattern. All hair types have a black tip, which corresponds to structural hair features (not pigment). b, Relative 
proportion of light, black and banded guard and awl hair along the striped mouse dorsoventral axis (n = 5; error bars represent s.e.m.). a, Scale bar, 100 um. 
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Extended Data Figure 2 | See next page for caption. 
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Extended Data Figure 2 | Stripe-like differences in hair length along 
the dorsum in striped mouse embryos and pups. a-c, Flat-mount skin 
preparations (dermis up) of embryos at E16 (a) and E19 (b) and P2 pups (c). 
Middle axis is indicated in all cases (midline). White dashed lines mark 
regions differing in hair length at E19 (b) and regions differing in 
pigmentation at P2 (c). Incipient pigmentation stripes are shown in (b). 
d, Skin punches (1 mm) and length measurements show differences 
between hair in the dark and light stripe of P2 individuals. Hair length 
differences in (b) (incipient stripes) correlate with those seen when 
pigment differences arise (c, d). Differences among dorsal regions were 
evaluated by ANOVA followed by a Tukey—Kramer test; n = 15 per 
region; **P < 0.01; ***P < 0.001. Red lines depict the mean. e, Hair 
length measurements taken from guard, awl and zigzag hair found along 
the dorsum of adults. Differences among dorsal regions were evaluated 


by ANOVA followed by a Tukey-Kramer test; n = 24 (guard), 12 (awl) 
and 12 (zigzag) per region; P= 0.1736 (guard hair), P= 0.8006 (awl 
hair), P= 0.1038 (zigzag hair). Red lines depict the mean. f, Predicted 
probabilities of the observed stripe-like phenotypes, as inferred by 
supervised learning models built and trained to recognize time-point- 
specific gene expression signatures of the stripes. Bars reflect average 
probabilities + s.e.m. computed from 30 consecutive iterations of the 
predictive model in each analysis. Labels indicate, as a ratio, the time point 
at which the randomForests model was applied to predict the stripe-like 
phenotype (the left term of the ratio), and the time point at which the 
model was built and trained (the right term of the ratio). The dotted line 
indicates the prior probability of either stripe phenotype (that is, 50% in 
case of only two distinct phenotypes). See Methods for details. 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


Epidermis 
o 
2 
Si 
x} 
x 
= 
o § 
By 
o 
2 
+S 
7 
= 
fo.) 
gs 
7) NS 
g 40 
+ 
Fe 
fin} g 30 
Bs 
ge” 
ge 
0 3 
So 10 
® 
2 
Eo 
2 Dark Stripe Light Stripe 
b 
7 
a) 
2 
2 § 
££ Oo 
Eg 
o£ 
a3 
E 
= 
Zz 


Stripes 


Extended Data Figure 3 | Cell proliferation and hair follicle density in 
P2 striped mice. a, Counts of proliferating cells, as determined by EdU 
labelling, in the epidermis and inside hair follicles (epidermal cells, dark 
stripe 1 versus light stripe, two-tailed t-test; n = 15 images per region; 
P=0.5417; cells counted: 402 (dark stripe 1) and 444 (light stripe); 
intrafollicular cells, dark stripe 1 versus light stripe, two-tailed t-test; 

n= 12 images per region; P= 0.7537; cells counted: 724 (dark stripe 1) 
and 680 (light stripe)). Dotted lines delineate the epidermal compartment 
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Inside hair follicle 


QD 
oO 


Number of Edu+ cells/ 
Hair follicle 
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and the hair bulb. b, Number of hair follicles per surface area along the 
dorsoventral axis. Differences among dorsal regions were evaluated by 
ANOVA; n = 10 images per region; P = 0.4391; number of hair follicles 
counted: 139 (light stripe), 141 (dark stripe 1), 132 (dark stripe 2) and 

128 (flank). Bright field images in a depict pigment. Boxes represent the 
25th to 75th percentile, whiskers show the minimum and maximum value, 
and the horizontal line indicates the median. Red bars indicate the mean. 
Scale bars in a 100,1m, b 200m. 
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Extended Data Figure 4 | RNA-seq analysis. a-d, Venn diagrams showing _ expression in the light stripe) or blue (higher expression in the flank). 
numbers of differentially expressed genes identified using either the Four differentially expressed pigmentation-related genes are highlighted 
M. musculus reference or the striped mouse de novo transcriptome (dark yellow or dark blue), while 11 additional pigmentation-related genes 
assembly in light versus dark stripes (a), light versus flank (b), and flank that are not differentially expressed are shown in pink (Supplementary 
versus dark (c), or differentially expressed genes in light or dark stripes Table 1b). f, The 1,148 genes demonstrating significant (FDR<0.1) 
versus the other skin region (light or dark stripes and the flank) (d). differential expression in the flank versus the dark stripe are shown in 
Genes that are specifically upregulated only in the dark or light stripes yellow (higher expression in the flank) or blue (higher expression in 
are highlighted in red. e, f, RNA-seq transcript levels (normalized gene the dark stripe). Eleven differentially expressed pigmentation-related 
counts) plotted as a function of differential expression (log, fold-change). genes are highlighted (dark yellow or dark blue), while six additional 
e, The 1,777 genes demonstrating significant (FDR < 0.1) differential pigmentation-related genes that are not differentially expressed are shown 
expression in the light stripe versus the flank are shown in yellow (higher in pink (Supplementary Table 1c). 
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switching genes Asip (a), Edn3 (b), and melanin synthesis genes Tyr and point; *P <0.05; **P < 0.01; ***P < 0.001. Red bars indicate the mean. 


Tyrp1 (c) in different regions of the striped mouse skin and at different 
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a generic vector for expressing human RNU6-1 promoter-driven short Supplementary Fig. 1. c, d, Quantitative PCR of Alx3 (white), Mitf (black) 
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Extended Data Figure 7 | Co-culture experiments. a—d, Wild-type B16 mRNA in B16 wild-type melanocytes exposed to keratinocytes (b) or 
melanocytes (B16 wild-type (WT)) were exposed to keratinocytes (a) or melanocytes (d) transduced with LV-Alx3:GFP (blue panels) or LV-GFP 
melanocytes (c) stably transduced with either LV-Alx3:GFP (LV-Alx3 in (red panels) (LV-Alx3:GFP versus LV-GFP, two-tailed t-test; n = 3, 
graphs) or LV-GFP. b, d, Quantitative PCR of levels of Alx3 mRNA incells — ***P < 0.001; NS, not significant). Red bars in b and d indicate the mean. 
carrying the lentiviral constructs (grey panel) and of Mitf, Tyr and Silver 
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Extended Data Figure 8 | Ultrasound-guided in utero lentiviral 
injections. a-f, Hair follicles from embryos injected at E8.5 with 
lentiviruses stained for SOX10 (a, d), virus transduced cells (b, e) and 
merged images with arrowheads showing SOX* GFP* cells (c, f). Dotted 
lines (a-f) delineate the hair bulb. g, Number of detectable SOX10* cells 
(LV—Alx3:GFP versus LV-GEP, two-tailed t-test; n = 60; P= 0.1173; cells 
counted: 426 cells (LV—Alx3:GFP) and 398 cells (LV-GFP)). h-o, Effect 

of Alx3 on skin. Hair follicles from samples injected with LV-GFP control 
and LV-Alx3:GFP depicting immunohistochemistry for K14 (h, k), virus 
transduced cells (i, 1), and merged images showing K14* GFP* cells (j, m). 
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n, Number of detectable K14* GFP* cells per follicular area (LV—Alx3:GFP 
versus LV-GFP, two-tailed t-test; n = 40, P= 0.275; average: 52.809 cells 
per hair follicle area (LV-Alx3:GFP) and 55.123 cells per hair follicle 

area (LV-GFP)). 0, Hair follicle density in samples injected with viruses 
(LV-Alx3:GFP versus LV-GEP, two-tailed f-test; n = 30; P= 0.103; average: 
0.84 hair follicles per surface area (LV—Alx3:GFP) and 0.794 hair follicles 
per surface area (LV-GFP)). Boxes represent the 25th to 75th percentile, 
whiskers show the minimum and maximum value, and the horizontal line 
indicates the median. Scale bars in a-f and h-m are 501M. 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


ARTICLE 


us 
Rhabdomys 


us 
Rhabdomys 


us 
Rhabdomys 


us 
Rhabdomys 


us 
Rhabdomys 


us 
Rhabdomys 


us 
Rhabdomys 


us 
Rhabdomys 


us 
Rhabdomys 


us 
Rhabdomys PATA A SATA | CHAATG? {| Conserved in mammals 


us 


Rhabdomys Al ¢ \ Mi EMSA probe 


us & 
Rhabdomys ATCACAG ATT TGCATGCTTCC sso: ©) Transcription start site 


us 


TT TTT TCWTT TT TCT MT ( C BAG CTGTCTGT( 
Rhabdomys SOTTTTCerTTT \ \ Cc i Conserved in Mus and 


striped mice 


us CTGTCTG 
Rhabdomys 


us 
Rhabdomys 


us 
habdomys 


ZS) 


us 
Rhabdomys 


us 
Rhabdomys 


us 
Rhabdomys 


us 
Rhabdomys 


us 
habdomys 


es) 


us 
Rhabdomys 


us 
Rhabdomys 


us 
Rhabdomys 


us 
Rhabdomys 


Extended Data Figure 9 | Alignment of a ~1.5-kb region of the Mitf (http://genome.ucsc.edu) (yellow), regions from which the EMSA probes 
M promoter in M. musculus and striped mouse. Black boxes represent were designed (red), and the transcription start site (green). The ten TAAT 
conserved sequences. Mapped onto the sequences are evolutionary binding sites, conserved between M. musculus and striped mice (blue), 
conserved regions of the mammalian Mitf M promoter identified in silico which were tested are labelled sequentially. 
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Extended Data Figure 10 | EMSA and luciferase assays. a, EMSAs show 
the binding of nuclear proteins from B16-F1 cells to candidate-binding 
sites 3, 5 and 10. The absence (—) or presence of non-specific (NSC; 500- 
fold molar excess) or specific (SC; indicated fold molar excess) competing 
oligonucleotides, or the addition of ALX3 antibodies or control (NRS or 
IgG) is indicated. Arrows indicate complexes containing ALX3, arrowhead 
shows supershift for site 3 and asterisk highlights an artefact in the gel. 

b, EMSAs showing the binding of recombinant Alx3 synthesized using 

a rabbit reticulocyte lysate system to the indicated sites (sites 4-9). The 
absence (—) or presence of non-specific (NSC) or specific (SC) competing 
oligonucleotides (500-fold molar excess) is indicated. Note that site 5 is the 


Site 4 Site 5 Site 6,7 Site8 Site9 


only one showing sequence-specific binding. For gel source images, 

see Supplementary Fig. 1. c, Relative levels of luciferase activity in B16-F1 
cells stably expressing GFP (dark circles) or Alx3 (light circles) for 
indicated binding sites. Luciferase activity was normalized relative to 
cells transfected with the empty reporter vector and values are given as a 
fraction of luminescence for GFP-transfected cells. Differences between 
cells transfected with LV-Alx3:GFP and LV-GFP for each plasmid were 
evaluated using two-tailed t-tests; n = 5; *P < 0.05; NS, not significant. 
Red lines depict the mean. Labels of mutated binding sites correspond to 
those described in Fig. 4a. 
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Bacteria establish an aqueous living 
space in plants crucial for virulence 


Xiu-Fang Xin!, Kinya Nomura!, Kyaw Aung!, André C. Velasquez!, Jian Yao!+, Freddy Boutrot?, Jeff H. Chang’, 


Cyril Zipfel? & Sheng Yang Hel?» 


High humidity has a strong influence on the development of numerous diseases affecting the above-ground parts of 
plants (the phyllosphere) in crop fields and natural ecosystems, but the molecular basis of this humidity effect is not 
understood. Previous studies have emphasized immune suppression as a key step in bacterial pathogenesis. Here we 
show that humidity-dependent, pathogen-driven establishment of an aqueous intercellular space (apoplast) is another 
important step in bacterial infection of the phyllosphere. Bacterial effectors, such as Pseudomonas syringae HopM1, 
induce establishment of the aqueous apoplast and are sufficient to transform non-pathogenic P. syringae strains into 
virulent pathogens in immunodeficient Arabidopsis thaliana under high humidity. Arabidopsis quadruple mutants 
simultaneously defective in a host target (AtMIN7) of HopM1 and in pattern-triggered immunity could not only be used 
to reconstitute the basic features of bacterial infection, but also exhibited humidity-dependent dyshomeostasis of the 
endophytic commensal bacterial community in the phyllosphere. These results highlight a new conceptual framework 
for understanding diverse phyllosphere-bacterial interactions. 


The terrestrial phyllosphere represents one of the most important hab- 
itats on Earth for microbial colonization. Although the vast majority 
of phyllosphere microbes exhibit benign commensal associations and 
maintain only modest populations, adapted phyllosphere pathogens 
can multiply aggressively under favourable environmental conditions 
and cause devastating diseases. In crop fields, phyllosphere bacterial 
disease outbreaks typically occur after rainfall and a period of high 
humidity, consistent with the ‘disease triangle’ (host-pathogen- 
environment) dogma formulated more than 50 years ago*. The molec- 
ular basis of the effect of high humidity on bacterial infection of the 
phyllosphere is not understood. 

Many plant and animal pathogenic bacteria, including the model 
phyllosphere bacterial pathogen Pseudomonas syringae, carry a type 
III secretion system (T3SS), which is used to deliver disease-promoting 
‘effector’ proteins into the host cell as a primary mechanism of 
pathogenesis™®. Studies of how individual type III effectors promote 
bacterial disease in plants and animals have shown that effector- 
mediated suppression of host immunity is a common occurrence in 
both plant-bacterial’~? and animal-bacterial interactions!°!!. However, 
owing to the apparent molecular complexities in bacterial diseases, 
which host processes must be impaired for basic bacterial pathogenesis 
to occur remains unknown in any plant or animal pathosystem. 


Immunosuppression and pathogenesis 

To investigate whether host immunity may be the only process 
that needs to be impaired for bacterial pathogenesis to occur in the 
phyllosphere, we performed infection assays in Arabidopsis polymu- 
tants that were severely defective in multiple immune pathways: (i) fls2 
efr cerk1 triple mutant (fec hereinafter), which is mutated in three major 
pattern recognition receptor (PRR) genes relevant to P. syringae pv. 
tomato strain DC3000 (Pst-DC3000) infection”; (ii) bak 1-5 bkk1-1 cerk1 
triple mutant (bbc hereinafter; see Methods), which is compromised 


in immune signalling downstream of multiple PRRs!*"!4; and 


(iii) dde2 ein2 pad4 sid2 quadruple mutant (deps hereinafter), which 
is defective in all three major defence hormone pathways (salicylic 
acid, jasmonate and ethylene)'*. Two nonpathogenic mutant deriva- 
tives of Pst-DC3000 were used: the hrcC™ mutant (defective in type 
III secretion)! and the DC3000428E mutant, in which the T3SS 
remains intact, but 28 of 36 type ITI effectors (A28E) are deleted’”. 
The hrcC~ and DC300047*= mutants grew very poorly not only in 
wild-type A. thaliana Columbia-0 (Col-0) (Fig. 1a), but also in immuno- 
compromised mutants when infiltrated into the apoplast, suggesting 
that host immunity is unlikely to be the only process impaired by 
Pst-DC3000 during infection. 


High humidity is required for pathogenesis 

During the active pathogenesis phase, phyllosphere bacterial patho- 
gens such as Pst-DC3000 live mainly in the air-filled apoplast, which is 
connected directly to open air through epidermal pores called stomata. 
The water status inside the apoplast could therefore be influenced by 
air humidity during pathogen infection. In crop fields, phyllosphere 
bacterial disease outbreaks typically occur after rainfall and a period of 
high humidity'*'8, consistent with the ‘disease triangle’ dogma in plant 
pathology. In addition, one of the earliest and most common symptoms 
of phyllosphere bacterial diseases is the appearance of ‘water soaking’ 
in infected tissues, although whether water soaking has an active role 
in bacterial pathogenesis remains unclear. Whereas Pst-DC3000 mul- 
tiplied to high numbers under high humidity conditions (approxi- 
mately 95%; mimicking high humidity after rainfall in crop fields), 
it multiplied much less under low humidity conditions (<60%) (Fig. 
1b), as reflected also by lower disease severity (Fig. 1c). The ability of 
Pst-DC3000 to multiply increased as humidity rose. By contrast, the 
hrcC~ mutant multiplied poorly under all humidity conditions tested 
(Fig. 1d). The most aggressive infection by Pst-DC3000 was associ- 
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Figure 1 | Full-scale Pst-D C3000 infection requires high humidity 

and is tightly associated with apoplast ‘water soaking’ a, Bacterial 
populations in Col-0, fec, bbc and deps leaves 2 days after infiltration with 
bacteria at 1 x 10° cfu ml~!. Humidity: ~95%. Two-way ANOVA with 
Tukey’s test (significance set at P< 0.05) was performed. No significant 
differences were found for DC3000 populations in different plant 
genotypes (indicated by the same letter a), whereas differences were found 
for the hrcC~ or DC30004** populations in different plant genotypes, as 
indicated by different letters of the same type (a’ versus b’ for hrcC~ and 
a” versus b’’ for DC3000%**), n = 4 technical replicates; data are shown 
as mean + s.d. Experiments were repeated three times with similar results. 
b, c, Bacterial populations (b) and disease symptoms (c) 3 days after 
infiltration with Pst-DC3000 at 1 x 10° cfu ml”. Significant difference 
was determined by Student's t-test (two-tailed); ***P= 1.08 x 10~°. 

n= 4 technical replicates; data are shown as mean + s.d. Experiments 
were repeated four times with similar results. RH, relative humidity. 

d, Bacterial populations in Col-0 leaves 3 days after infiltration with 
bacteria at 1 x 10° cfu ml“. Statistical analysis as in a. DC3000 
populations showed significant differences under different humidities 
(as indicated by different letters) whereas hrcC” populations did not 
(indicated by a’). n=3 technical replicates; data are shown as mean +s.d. 
Experiments were repeated three times with similar results. e, The 
abaxial sides of Col-0 leaves 24h after infiltration with Pst-DC3000 at 

1 x 10° cfu ml~!. Humidity: ~95%. Dark spots on the leaf indicate water- 
soaking spots. Red boxes indicate enlarged regions. f, A tomato leaf 

(cv. Castlemart) 3 days after infiltration with Pst-DC3000 at 

1 x 104 cfu ml~’. Humidity: ~95%. Yellow circles in e and f indicate 
infiltration sites. Images are representative of water-soaked leaves from 
more than four plants. g, Col-0 plants were dip-inoculated with 

bacteria at 2 x 10° cfu ml~!. Humidity: ~95%. Bacterial colonies in 
inoculated leaves were visualized 2 days later by a charge-coupled device 
(top) and images of leaves were taken to show water soaking spots 
(middle). Bottom, merged images, with artificial red colour labelling 
Pst-DC3000-lux bacteria. Experiments were repeated three times. Images 
are representative of leaves from more than four plants. 
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ated with the appearance, usually within one day after infection, of 
water soaking in the infected Arabidopsis leaves under high humidity 
(Fig. le). Water-soaked spots could also be observed in Pst-DC3000- 
infected leaves of another host species, Solanum lycopersicum cv. 
Castlemart (Fig. 1f). Real-time imaging (Supplementary Video 1) 
showed that the initial water-soaked spots marked the areas of later 
disease symptoms (necrosis and chlorosis). It also showed that water 
soaking was a transient process, which disappeared before the onset 
of late disease symptoms. Using a Pst-DC3000 strain tagged with a 
luciferase reporter (DC3000-Iux!°), we found that water-soaking areas 
and luciferase signals were detected non-uniformly across the leaf, but 
they overlapped extensively (Fig. 1g and Extended Data Fig. 1), showing 
that water-soaked areas are where bacteria multiply aggressively in the 
phyllosphere before the onset of late disease symptoms. 


P. syringae water -soaking effectors 

The DC3000“7*¥ mutant did not cause water soaking under any exper- 
imental conditions (for example, high humidity or inoculum). We 
therefore transformed each of the 28 Pst-DC3000 effector genes back 
into the DC3000%"* mutant individually, to identify the effector(s) 
that confer the ability to cause water soaking. Most effectors did not 
(see Fig. 2a for avrPto, as an example), except for hopM1 and avrE1, 
together with their respective type III secretion chaperones shcM and 
avrF (Fig. 2a). This was noteworthy, because although HopM1 and 
AvrE1 show no sequence similarity, they were previously shown to be 
functionally redundant in virulence and they are highly conserved in 
diverse P. syringae strains and/or other phytopathogenic bacteria”! 
Moreover, transgenic overexpression of 6x His:HopM1 (ref. 22) or 
6x His:AvrE1 (ref. 23) in Arabidopsis, under the control of a 
dexamesathone-inducible promoter (101M), also caused water soaking 
under high humidity (Fig. 2b). By contrast, transgenic expression of 
AvrPto (ref. 29), like the DC300047*(avrPto) strain, did not induce water 
soaking. These results show that HopM1 and AvrE1, expressed either 
by bacteria or by transgenic overexpression in plant cells, are each 
sufficient to cause water soaking. 

Bacterial mutant analysis showed that the hopM1 and avrE1 genes 
are necessary for Pst-DC3000 to cause water soaking during infection, 
whereas the avrE1~/hopM1~ double mutant” could not cause water 
soaking, even when the inoculum of the avrE1~/hopM1~ mutant was 
adjusted to reach a similar population as Pst-DC3000 when water soak- 
ing was assessed (Fig. 2c). By contrast, Pst-DC3000 and the avrE1~ 
and hopM1~ single mutants”° caused strong initial water soaking 
(Extended Data Fig. 2a) and later disease symptoms (Extended Data 
Fig. 2b) and multiplied aggressively in a high-humidity-dependent 
manner, whereas the avrE1~/hopM1~ double mutant multiplied poorly 
regardless of humidity conditions (Fig. 2d). Transgenic expression of 
6 x His: HopM1 in Arabidopsis (0.1 nM was used to induce low-level 
expression of HopM1 so that HopM1 alone does not cause extensive water 
soaking) greatly enhanced the ability of the avrE1~/hopM1~ double 
mutant to cause water soaking and multiply extensively under high 
humidity conditions (Extended Data Fig. 2c, d). These results showed 
that, unlike the other 34 effectors present in the avrE1~/hopM1~ double 
mutant, the full virulence functions of HopM1 and AvrE1 are uniquely 
dependent on external high humidity. 

Next, we investigated why the virulence functions of HopM1 and 
AvrE1 were dependent on external humidity. We hypothesized that 
the primary function of HopM1 and AvrE1 is for creating an aqueous 
apoplast (that is, bacteria ‘prefer’ to live in an aqueous environment in 
the apoplast), the maintenance of which requires high humidity as the 
leaf apoplast is directly connected to open air through stomata. If so, 
it may be possible to substitute the function of HopM1 and AvrE1 by 
simply providing water to the apoplast. To test this hypothesis directly, 
we performed transient water supplementation experiments in which 
Col-0 plants infiltrated with the avrE1~/hopM1~ mutant were tran- 
siently kept water-soaked, for the first 12-16 h to mimic the kinetics 
of transient water soaking that normally occur during Pst-DC3000 
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Figure 2 | Type III effectors AvrE1 and HopM1 are necessary and 
sufficient to cause water soaking. a, Col-0 leaves 24h after infiltration 
with bacteria (1-2 x 10° cfu ml~'). Humidity: ~95%. b, Leaves of 
transgenic 6 x His: HopM1 (ref. 22), 6 His:AvrE1 (ref. 23) or AvrPto (ref. 29) 
plants after spraying with 101M dexamethasone (DEX; to induce effector 
gene expression). Humidity: ~95%. Col-0 or Col-0 gl1 (glabra1) plants 
were non-transgenic parental controls. Images are representative of leaves 
from more than four plants. c, Col-0 leaves (left) and bacterial populations 
(right) 24h after infiltration with Pst-DC3000 (1 x 10° cfu ml“) or the 
avrE1/hopM1- strain (1 x 10’ cfu ml“). Humidity: ~95%. Student's 
t-test (two-tailed); NS, not significant (P = 0.104). n =6 technical 
replicates from three independent experiments (n = 2 in each experiment); 
data are shown as mean + s.d. d, Bacterial populations in Col-0 

plants 3 days after infiltration with bacteria at 2 x 10° cfu ml"!. 

*§D = 1.07 x 107%, 8.07 x 1077 and 5.95 x 1077 for DC3000, the avrE1~ 
mutant and the hopM1~ mutant, respectively, for differences in bacterial 
population between different humidities, as determined by Student’s t-test 
(two-tailed); NS, not significant (P = 0.13). n=4 technical replicates; 

data are shown as mean + s.d. Experiments were repeated three times. 

e, f, Bacterial populations (e) and images (f) of Col-0 leaves 3 days after 
infiltration with bacteria at 1 x 10° cfu ml~!. In the ‘—H,O’ treatment, 
plants were air-dried normally (for about 2h) and then kept under high 
humidity (approximately 95%). In the “+H2O’ treatment, plants were 

kept under high (80-95%) humidity after syringe-infiltration to allow 
slow evaporation of water (for about 16h, until no visible apoplast water 
was seen). **P= 8.29 x 10-3 and 1.14 x 1077 for DC3000 and hrcC~, 
respectively and ***P=7.61 x 10-7 and 9.82 x 10-* for avrE1- /hopM1~ 
and CUCPB5452, respectively, indicate significant differences between 
—H,0 and +H,0 treatments as determined by Student's t-test (two-tailed). 
n= 3 technical replicates; data are shown as mean + s.d. Experiments were 
repeated three times. 


infection (Supplementary Video 1). Transient apoplast water supple- 
mentation was sufficient to restore the multiplication (100- to 1,000- 
fold) of the avrE1~/hopM1~ mutant almost to the level of Pst-DC3000 
(Fig. 2e), as well as to induce appearance of severe disease symptoms 
(Fig. 2f). By contrast, the controls, Pst-DC3000, the hrcC™ mutant and 
CUCPB5452 (which expresses the avrE1 and hopM1 genes but has 
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Figure 3 | Effects of AUMIN7 and effector-triggered immunity on water 
soaking. a, atmin7 leaves, but not Col-0 leaves, showed partial water 
soaking 48 h after dip-inoculation with the avrE1~/hopM1~ mutant at 

1 x 108 cfu ml~!. Humidity: ~95%. Water soaking disappeared after 
transition to low humidity (~25%) to allow evaporation of apoplast 
water. Images are representative of leaves from more than four plants. 

b, c, ETI prevents apoplast water soaking. Col-0 and rps2 leaves were 
infiltrated with Pst-DC3000 (1 x 10° cfu ml’) or Pst-DC3000(avrRpt2) 
(1 x 10” cfu ml for Col-0 and 1 x 10° cfu ml“! for rps2 plants). Plants 
were kept under high humidity (~95%) for 24h to observe water soaking 
and then shifted to low humidity (~50%) for 4h to observe ETI-associated 
tissue collapse. Images were taken before and after low humidity exposure 
(b) and bacterial populations were determined 24h after infiltration to 
show similar population levels (c). Statistical analysis of data in c was 
performed by one-way ANOVA with Tukey’s test (significance set at 
P<0.05), and no significant difference was detected. n =3 technical 
replicates; data are shown as mean + s.d. Experiments were repeated 
three times. d, AtMIN7 protein is stabilized during ETI as revealed 

by immunoblot. Col-0 or atmin7 leaves were infiltrated with bacteria 

(1 x 107 cfu ml~!; ref. 25) or H2O and kept under high humidity (~95%) 
for 24h before protein extraction. Asterisk indicates a non-specific band. 
Coomassie blue staining shows equal loading. See Supplementary Fig. 1 for 
uncropped images. 


much reduced virulence owing to deletion of other type III effectors”) 
grew only slightly better (<10 fold) with transient water supplemen- 
tation (Fig. 2e). These results demonstrate that the primary virulence 
function of HopM1 and AvrE] can be effectively substituted by tran- 
siently supplying water to the apoplast. 


The host target of HopM1 for modulating water soaking 

To investigate the mechanism by which HopM1 creates the aqueous apo- 
plast, we focused on the host targets of HopM1 in Arabidopsis. We have 
previously shown that HopM1 is targeted to the trans-Golgi-network- 
early endosome (TGN/EE) in the host cell and mediates proteasome- 
dependent degradation of several host proteins, including AtMIN7, 
which is a TGN/EE-localized ADP ribosylation factor-guanine 
nucleotide exchange factor involved in vesicle trafficking””?>®. 
Although the atmin7 mutant plant showed partially increased bacterial 
multiplication”, the exact role of AtMIN7 during pathogen infec- 
tion remains unknown. A previous study showed that the virulence 
function of HopM1 is fundamentally different from that of canonical 
immunosuppressing effectors, such as AvrPto!”. As our data showed a 
role for HopM1 in creating water soaking, we tested the possibility that 
AtMIN7 may modulate apoplast water soaking in response to bacte- 
rial infection. Notably, we found that apoplast water soaking occurred 
in the atmin7 mutant plant in the absence of HopM1/AvrE1 (that is, 
during infection by the avrE1~/hopM1~ mutant; Fig. 3a and Extended 
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Figure 4 | hopM1/shcM transform the non-pathogenic pene! 
mutant into a highly virulent pathogen in PTI-deficient mutant plants 
in a humidity-dependent manner. a, b, Bacterial populations (a) and 
disease symptoms (b) 3 days after infiltration with the indicated bacteria 
at 1 x 10° cfu ml~’. Humidity: ~95%. Statistical analysis was performed by 
one-way ANOVA with Tukey’s test (significance set at P< 0.05). Bacterial 
populations indicated by different letters (a, b and c) are significantly 
different (ab is not significantly different from a or b). n=4 technical 
replicates; data are shown as mean + s.d. Experiments were repeated three 
times. c, Water-soaking symptoms were recorded 24h after inoculation. 

d, Bacterial populations 3 days after infiltration with DC300047*(hopM1/ 
shcM) at 1 x 10° cfu ml“! under indicated humidities. Statistical analysis 
as in a. Bacterial populations indicated by different letters (a, b and c) 

are significantly different. n = 4 technical replicates; data are shown 

as mean + s.d. Experiments were repeated three times. Images are 
representative of leaves from at least four plants. 


Data Fig. 3c), and the avrE1~/hopM1~ mutant could multiply in atmin7 
mutant plant (Extended Data Fig. 3a, b). Thus, genetic removal of 
AtMIN7 is sufficient to mimic the virulence function of HopM1, albeit 
partially, in causing apoplast water soaking. The atmin7 mutant plant 
is defective in endocytic recycling of plasma membrane proteins and 
has an abnormal plasma membrane”, suggesting that HopM1 degrades 
AtMIN7, probably to compromise host plasma membrane integrity 
as a mechanism to create an infection-promoting aqueous apoplast 
(Extended Data Fig. 4). 

If apoplast water soaking is an essential step in pathogenesis, we 
hypothesized that plants may have evolved defence mechanisms to 
counter it. Indeed, we found that effector-triggered immunity (ETI)”” 
induced by Pst-DC3000(avrRpt2) blocked water soaking, even when 
the inoculum of Pst-DC3000(avrRpt2) was increased to reach a popula- 
tion similar to Pst-DC3000 when water soaking was assessed (Fig. 3b, c 
and Extended Data Fig. 5a, b). When transferred from high (approx- 
imately 95%) to low (around 50%) humidity, Pst-DC3000(avrRpt2)- 
infected leaves quickly wilted, indicating extensive ETI-associated 
programmed cell death. By contrast, Pst-DC3000-infected, water- 
soaked leaves returned to pre-infection healthy appearance (Fig. 3b), 
indicating little host cell death during apoplast water soaking. 
Furthermore, Pst-DC3000(avrRpt2)-triggered ETI stabilized the 
AtMIN7 protein’> (Fig. 3d). These results therefore show a previously 
unknown interaction between bacterial virulence (creating apoplast 
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water soaking) and host defence (preventing apoplast water soaking), 
which is linked in part to AtMIN7 stability, leading to modulation of 
apoplast water availability. 


Reconstitution of P. syringae infection 

As apoplast water soaking seems to be a key process in bacterial 
pathogenesis, we investigated a model in which suppression of 
pattern-triggered immunity (PTI) and creation of apoplast water 
soaking are two principal pathogenic processes sufficient for bac- 
terial infection of the phyllosphere to occur. We infected Col-0 
and two PTI-compromised triple mutant plants (fec and bbc) with 
DC3000478*, DC300047°§(avrPto) or DC300047**(hopM 1/shcM) 
and found that DC30004**(hopM 1/shcM), but not DC3000%7* or 
DC30004"*"(avrPto), caused strong water soaking, multiplied aggres- 
sively (almost to the level of Pst-DC3000) and produced prominent 
disease symptoms in the fec and bbc mutant plants (Fig. 4a—c) ina 
high humidity-dependent manner (Fig. 4d). Furthermore, unlike 
PTI mutants, the mpr1-6 mutant plant, which is defective in salicylic- 
acid-dependent defence (Extended Data Fig. 6a—c), did not show 
greatly increased DC300047**(hopM1/shcM) multiplication (Fig. 4a). 
Thus, a combination of defective PTI and the presence of an aqueous- 
apoplast-inducing effector (HopM1) could almost fully convert a 
non-pathogenic mutant into a virulent pathogen in the Arabidopsis 
phyllosphere. 

If immunosuppression and creation of apoplast water soaking are 
two principal pathogenic processes sufficient for bacterial infection 
of the phyllosphere, we hypothesized that it might be possible to con- 
struct a multi-host-target mutant that simulates the two processes. 
This mutant plant might allow an otherwise non-pathogenic mutant 
bacterium (for example, the hrcC™ mutant) to colonize the phyllosphere, 
thereby reconstituting basic features of a phyllosphere bacterial infec- 
tion. For this purpose, we mutated the AtMIN7 gene in PTI mutants 
(fec and bbc) and generated atmin7 fls2 efr cerk1 (mfec hereinafter) 
and atmin7 bak 1-5 bkk1-1 cerk1 (mbbc hereinafter) quadruple mutants 
using CRISPR-Cas9 technology (see Methods and Extended Data 
Fig. 7a). The quadruple mutant plants display a similar morphology to 
wild-type Col-0 plants (Extended Data Fig. 7b) and have a tendency 
to show some water-soaking spots, especially in mature leaves, under 
high humidity (Extended Data Fig. 7c, d). Notably, these mutants 
allowed the non-pathogenic hrcC™ mutant to multiply aggressively 
under high (approximately 95%) humidity, to a final population that 
was about 100-fold higher than in Col-0 plants 5 days after inocula- 
tion, with the mbbc plants showing a greater susceptibility than the 
mfec plants (Fig. 5a). In addition, in these quadruple mutant plants, 
the hrcC~ mutant induced prominent disease chlorosis and necrosis 
(Fig. 5b and Extended Data Fig. 7e), which were not observed for the 
hrcC~ strain in Col-0, atmin7 or PTI mutants. Thus, dual disruption of 
AtMIN7 and PTI signalling is sufficient to generate the basic features 
of a model phyllosphere bacterial disease. Consistent with these data, 
transient water supplementation to the leaf apoplast was sufficient to 
enhance the growth of the hrcC™ mutant in the bbc triple mutant, but 
not in Col-0 plants (Fig. 5c). To our knowledge, this is the first infec- 
tious model disease, in plant or animal, for which basic pathogenesis 
has been generated using biologically relevant host-target mutants. 


Dyshomeostasis of commensal bacteria 

The inability of the non-pathogenic hrcC” mutant to multiply aggres- 
sively in the wild-type phyllosphere resembles that of the commensal 
bacterial community that resides in the apoplast of healthy leaves. 
Consistent with this, only low levels of the endophytic phyllosphere 
bacterial community were detectable in wild-type Col-0 plants 
(Fig. 5d). However, after plants were shifted from regular growth 
conditions (around 60% relative humidity, day 0; Fig. 5d) to high humidity 
conditions (approximately 95% relative humidity), the mfec and mbbc 
quadruple mutant plants, but not Col-0 plants, showed excessive 
proliferation of the endogenous endophytic bacterial community 
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Figure 5 | Disease reconstitution experiments. a, b, hrcC” bacterial 
populations 5 days (a) and disease symptoms 10 days (b) after dip- 
inoculation in Col-0, fec, bbc, atmin7, mfec and mbbc plants grown in 
Redi-Earth soil. Humidity: ~95%. Statistical analysis was performed by 
one-way ANOVA with Tukey’s test (significance set at P< 0.05). Bacterial 
populations indicated by different letters (a, b, c and d) are significantly 
different (ad is not significantly different from a or d). n= 4 technical 
replicates; data are shown as mean + s.d. Experiments were repeated 

four times. c, hrcC” bacterial populations in Col-0 and bbc leaves 3 days 
after infiltration with bacteria at 1 x 10° cfu ml~!. The —H30 and 

+H,0 conditions are as in Fig. 2e. Statistical analysis was performed by 
one-way ANOVA with Tukey’s test (significance set at P< 0.05). Bacterial 
populations indicated by different letters (a, b and c) are significantly 
different (ab is not significantly different from a or b). n =3 technical 
replicates; data are shown as mean + s.d. Experiments were repeated 
three times. d, Col-0, fec, bbc, atmin7, mfec and mbbc plants grown in 
Arabadopsis Mix soil were mock-sprayed with H2O and kept under 

high humidity (~95%). On day 0 (before water spray) and day 5, total 
populations of the endophytic bacterial community were quantified by 
counting CFUs on R2A plates, after surface sterilization of leaves with 
75% ethanol, leaf homogenization and serial dilutions. FW, fresh weight. 
Statistical analysis as in a. Bacterial populations indicated by different 
letters (a and b) are significantly different. n = 4 technical replicates; data 
are shown as mean + s.d. Experiments were repeated three times. 

e, A proposed new model for Pst-DC3000 pathogenesis in Arabidopsis. 
Dashed arrows indicate a possible interaction, at spatial and temporal scales, 
between ‘immunosuppressior and a ‘wet apoplast’ during pathogenesis. 


(Fig. 5d and Extended Data Table 1), in a high-humidity-dependent 
manner (Extended Data Fig. 8a). Furthermore, the excessive prolifer- 
ation of the endophytic bacterial community was associated with mild 
tissue chlorosis and necrosis in some leaves (Extended Data Fig. 8b). 
This was noteworthy, because a recent study showed that overgrowth of 
a beneficial root-colonizing fungus in immunocompromised (against 
fungal pathogens) plants also led to harmful effects in Arabidopsis”*, 
illustrating a potentially common occurrence where the levels of 
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commensal and beneficial microbiota must be strictly controlled by the 
host for optimal plant health. It would be interesting in the future to 
conduct comprehensive in planta 16S rRNA amplicon-based analysis 
to determine whether there are also humidity-dependent changes in 
the composition of commensal bacterial communities in the Col-0, 
mfec and mbbc plants. 


Discussion 

Our data suggest a new conceptual framework for understanding 
phyllosphere-bacterial interactions (Fig. 5e). Specifically, we have iden- 
tified PTI signalling and AtMIN7, presumably via vesicle trafficking, 
as two key components of the elusive host barrier that functions to 
limit excessive and potentially harmful proliferation of non-pathogenic 
microbes (for example, the hrcC” mutant) in the phyllosphere. 
Pathogenic bacteria, such as Pst-DC3000, have evolved T3SS effec- 
tors not only to disarm PTI signalling but also to establish an aqueous 
living space in a humidity-dependent manner in order to aggressively 
colonize the phyllosphere. The conceptual framework presented here 
integrates host, pathogen and environmental factors, and provides a 
critical insight into the basis of the strong effect of humidity on the 
development of numerous bacterial diseases, consistent with the 
‘disease triangle’ dogma in plant pathology. 

Prior to this study, humidity was commonly thought to promote bac- 
terial movements on the plant surface and invasion into plant tissues. 
Our study, however, reveals a notable and previously unrecognized 
effect of high humidity on the function of bacterial effectors inside 
the plant apoplast. An aqueous apoplast could potentially facilitate the 
flow of nutrients to bacteria, promote the spread and/or egression of 
bacteria, and/or affect apoplastic-host-defence responses, the latter of 
which may underlie some of the previously observed effects of HopM1, 
AvrE1 and AtMIN7 on plant immunity*!??* and suggest a potential 
cross-talk between plant immune responses and water availability. 

Most of our current knowledge on plant-pathogen and plant- 
microbiome interactions are derived from studies under limited 
laboratory conditions. This study illustrates a need for future research 
to consider the dynamic climate conditions in which plants and 
microbes live in nature in order to uncover new biological phenomena 
involved in host-microbe interactions. Research that unravels the 
molecular bases of environmental influences of disease development 
should help us to understand the severity, emergence and/or disap- 
pearance of infectious diseases in crop fields and natural ecosystems, 
especially in light of the rapidly changing drought-humidity patterns 
associated with global climate change. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

Plant materials and bacterial strains. Arabidopsis thaliana plants were grown in 
Arabidopsis Mix soil (equal parts of SUREMIX (Michigan Grower Products Inc.), 
medium vermiculate and perlite; autoclaved once) or Redi-Earth soil (Sun Gro 
Horticulture) in environmentally controlled growth chambers, with relative humid- 
ity at 60%, temperature at 22°C and a 12h light-12h dark cycle. Five-week-old 
plants were used for bacterial inoculation and disease assays. 

The bak1-5 bkk1-1 cerk1 mutant plant was generated by crossing the bak1-5 

bkk1-1 mutant" with the cerk1 mutant*’. PCR-based genotyping was performed in 
F, progeny to obtain a homozygous triple mutant. The npr1-6 (Fig. 4a) mutant was 
the SAIL_708_FO09 line ordered from the Arabidopsis Biological Resource Center, 
and confirmed to be a knockout mutant and defective in salicylic acid signalling 
(Extended Data Fig. 6). 
Bacterial disease assays. Syringe-infiltration and dip-inoculation were per- 
formed in this study. Briefly, Pst-DC3000 and mutant strains were cultured in 
modified Luria-Bertani medium (10g 1”! tryptone, 6g1~! yeast extract, 15g]! 
KH>POu, 0.6g 17! NaCl, and 0.4g 1-' MgSO4-7H20) containing 100mg I"! 
rifampicin (and/or other antibiotics if necessary) at 28°C to an ODgop of 0.8-1.0. 
Bacteria were collected by centrifugation and re-suspended in sterile water. 
Cell density was adjusted to OD 600 = 0.2 (approximately 1 x 108 cfu ml”). For 
syringe-infiltration, bacterial suspension was further diluted to cell densities 
of 1 x 10°-1 x 10° cfu ml~!. Unless stated otherwise, infiltrated plants were 
first kept under ambient humidity for 1-2h for water to evaporate, and after 
the plant leaves returned to pre-infiltration appearance, plants were kept under 
high humidity (approximately 95%; by covering plants with a clear plastic dome) 
or other specified humidity settings for disease to develop. For dip-inoculation, 
plants were dipped in the bacterial suspension of OD¢09 = 0.2, with 0.025% Silwet 
L-77 added, and then kept under high humidity (approximately 95%) immediately 
for disease to develop. 

Different humidity settings were achieved by placing a clear plastic dome over 
a flat (in which plants are grown) with different degrees of opening. A humidity- 
temperature Data Logger (Lascar) was placed inside the flat to record the humidity 
and/or temperature over the period of disease assay. 

For quantification of Pst-DC3000 bacterial populations, Arabidopsis leaves 
were surface-sterilized in 75% ethanol and rinsed in sterile water twice. Leaf disks 
were taken using a cork borer (9.5mm in diameter) and ground in sterile water. 
Colony-forming units were determined by serial dilutions and plating on Luria- 
Marine plates containing 100 mg 1“! rifampicin. One technical replicate consists 
of one or two leaf disks and 3 or 4 technical replicates (that is, from at least 3 or 4 
leaves) were included in each biological experiment. Experiments were repeated 
at least three times. 

CRISPR-Cas9-mediated mutation of the AtMIN7 gene. The one-plasmid 
CRISPR-Cas9 cloning system*! was used to mutate AtMIN7 into the fls2 efr 
cerk1 and bak1-5 bkk1-1 cerk1 plants. AtMIN7-single guide (sg)RNA primers 
containing target mutation regions were as follows, with AtMIN7 sequence 
underlined. 

AtMIN7-sgRNA-F, GATTGATCATTTGGAAGGGGATCC; AtMIN7-sgRNA-R, 
AAACGGATCCCCTTCCAAATGATC. 

The constructs containing AL(MIN7-sgRNA and Cas9 were cloned in pCAM- 
BIA1300, which were then mobilized into Agrobacterium tumefaciens for plant 
transformation. For genotyping of ALMIN7-mutated lines, total DNA was extracted 
from individual lines and the regions containing the CRISPR target sites were 
amplified by PCR using the following primers: 

AtMIN7-sgRNA-F2, GATGCTGCTTTGGATTGTCTTC; AtMIN7-sgRNA-R2, 
AATGGCTCCCCATGCACTGCGATA. 

For genotyping, the PCR products were digested with the BamHI restric- 
tion enzyme and plant lines showing an (partially or completely) uncut band 
were chosen. The PCR products of putative homozygous T lines, identified 
based on a lack of cutting by BamHI, were sequenced. The lines showing a 
frame-shift mutation and an absence of the Cas9 gene based on PCR using the 
following primers were identified as homozygous lines. The T3 and T, progeny 
of homozygous lines were used for disease assays. Primers for PCR-amplifying 
Cas9 gene were as follows:Cas9-F, CCAGCAAGAAATTCAAGGTGG; Cas9-R, 
GCACCAGCTGGATGAACAGCTT. 


Imaging of bacterial colonization with luciferase assay. Four-week-old 
Arabidopsis Col-0 plants were dip-inoculated with the Pst-DC3000 or Pst-DC3000- 
lux strain”. The infected plants were fully covered with a plastic dome to maintain 
high humidity. Leaves were excised from the infected plants 2 days after inocula- 
tion and the light signals were captured by a charge-coupled device (CCD) using 
ChemiDoc™ MP system (Bio-Rad). 

AtMIN7 protein blot. Arabidopsis leaves were syringe-infiltrated with bacteria or 
H,O and kept under high humidity (approximately 95%) for 24h. Leaf disks were 
homogenized in 2 SDS buffer, boiled for 5 min and centrifuged at 10,000g for 
1 min. Supernatants containing the total protein extracts were subjected to separa- 
tion by SDS-polyacrylamide gel electrophoresis (PAGE). An AtMIN7 antibody” 
was used for western blotting to detect the AtMIN7 protein. Uncropped blot and 
gel images are included in Supplementary Fig. 1. 

Bacterial community quantification. Five-week-old plants were sprayed with 
H,O and covered with a plastic dome to keep high humidity (approximately 95%) 
for 5 days. To quantify the endophytic bacterial community, leaves were detached, 
sterilized in 75% ethanol for 1 min (Extended Data Fig. 9) and rinsed in sterile 
water twice. Leaves were weighed and ground in sterile water using a TissueLyser 
(Qiagen; at the frequency of 30 times per second for 1 min) in the presence of 3-mm 
Zirconium-oxide grinding beads (Glen Mills; 5 beads in each tube). After serial 
dilutions, bacterial suspensions were plated on R2A plates (Teknova), which were 
kept at 22°C for 4 days before colonies were counted. Colony-forming units were 
normalized to tissue fresh weight. 

16S rRNA amplicon sequence analysis of the endophytic bacterial community. 
The Col-0, mfec and mbbc plants were sprayed with water and kept under high 
humidity (approximately 95%) for 5 days. Leaves were surface-sterilized in 75% 
ethanol for 1 min and rinsed in sterile water twice. Leaves from four plants were 
randomly selected (2 leaves from each plants; 8 leaves in total) and were divided 
into 4 tubes (2 leaves in each tube) and ground in sterile water. Bacterial sus- 
pensions were diluted (Col-0 samples were diluted to 10-7 and mfec and mbbc 
samples were diluted to 10~°) and, for each genotype, 1511 suspension from each 
tube of the right dilution (107? dilution for Col-0 and 10~° for mfec and mbbc) 
were pooled together and plated on R2A plates, which were kept at 22°C for 
4 days. Fifty colonies from each genotype were randomly picked, genomic DNA 
was extracted and PCR was performed with AccuPrime high-fidelity Taq DNA 
polymerase (Invitrogen) and primers 799F/1392R (ref. 32) to amplify bacterial 16S 
rRNA gene. The PCR product was sequenced and taxonomy of each bacterium 
(family level) was determined by Ribosomal Database Project at Michigan State 
University (https://rdp.cme.msu.edu/)**. 

Data analysis, statistics and experimental repeats. The experiments were not ran- 
domized and the investigators were not blinded to allocation during experiments 
and outcome assessment. The specific statistical method used, the sample size and 
the results of statistical analyses are described in the relevant figure legends. Sample 
size was determined based on experimental trials and with consideration of previ- 
ous publications on similar experiments to allow for confident statistical analyses. 
The Student's two-tailed t-test was performed for comparison of means between 
two data points. A one-way or two-way ANOVA with Tukey’s test was used for 
multiple comparisons within a data set, with significance set to a P value < 0.05. 
ANOVA analysis was performed with GraphPad Prism software. 

Data Availability. The bacterial 16S rRNA sequences in Extended Data Table 1 
have been deposited in the National Center for Biotechnology Information (NCBI) 
GenBank database under accession numbers KX959313-KX959462. Other data 
that support the findings of this study are available from the corresponding author 
upon request. 
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Extended Data Figure 1 | Water soaking does not affect luminescence signal. Col-0 plants were dip-inoculated with bacteria at 2 x 10° cfu ml! and 
kept under high humidity (approximately 95%) for 2 days. Imaging was performed in the same way as in Fig. 1g. Water-soaked leaves were air-dried for 
about 2h and imaged again (right panel). Images are representative of leaves from more than four plants. 
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Extended Data Figure 2 | Humidity dependence of HopM1/AvrE1 were taken 3 days after inoculation. Images are representative of leaves 
function and restoration of the virulence of the avrE1~ /hopM1— from more than four plants. c, d, The 6x His: HopM1 transgenic plants” 
mutant in 6 x His: HopM1 transgenic plants. a, b, The virulence of were infiltrated with 0.1 nM dexamesathone (DEX), the avrE1~/hopM1- 
the avrE1-/hopM1~ mutant is insensitive to humidity settings. a, Col-0 mutant (at 1 x 10° cfu ml~’) or both. H2O was infiltrated as control. 
plants were syringe-infiltrated with indicated bacteria at 2 x 10° cfu ml“!. Infiltrated plants were kept at high humidity (approximately 95%). Leaf 
Inoculated plants were kept under high (approximately 95%) humidity, images were taken 24h after infiltration (c) and bacterial populations 
and images were taken 24h after infiltration. b, Col-0 plants were syringe- were determined 3 days after infiltration (d). Significant difference was 
infiltrated with Pst-DC3000, the avrE1~ mutant, the hopM1~ mutant or determined by Student's f-test; (two-tailed); ***P = 1.03 x 10->.n=6 
the avrEI~/hopM1~ mutant at 2 x 10° cfu ml”. Inoculated plants were technical replicates from three independent experiments (n =2 in each 
kept under high (approximately 95%) or low (20-40%) humidity. Images experiment); data are shown as mean + s.d. 
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Extended Data Figure 3 | Bacterial multiplication and water soaking 
in Col-0 and the atmin7 mutant. a, The Col-0 and atmin7 plants were 
dip-inoculated with Pst-DC3000, the avrE1/hopM1~ mutant or the hrcC™ 
mutant at 1 x 10° cfu ml~!. Bacterial populations were determined 4 days 
after inoculation. Significant difference between Col-0 and atmin7 plants 
was determined by Student's t-test (two-tailed); *P = 1.61 x 10-7 and 
3.12 x 10°? for DC3000 and hrcC~, respectively; ***P = 1.41 x 10~4 for 
avrE1 /hopM1-.n=4 technical replicates; data are shown as mean + s.d. 
Experiments were repeated three times. b, c, The Col-0 and atmin7 plants 
were syringe-infiltrated with Pst-DC3000, the avrE1~/hopM1~ mutant 
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or the hrcC~ mutant at 1 x 10° cfu ml“ !. Bacterial populations were 
determined 3 days after inoculation (b) and leaf images were taken 38h 
after infiltration with the averE-/hopM1~ mutant strain to show water 
soaking in atmin7 leaves (c). Significant difference between Col-0 

and atmin7 plants was determined by Student’s t-test (two-tailed); 

** P— 1.63 x 10°? for avrE1 /hopM1 ; NS, not significant (P= 0.72 and 
0.14 for DC3000 and hrcC’, respectively). n =3 technical replicates; data 
are shown as mean + s.d. Experiments were repeated three times. Images 
are representative of leaves from more than four plants. 
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Extended Data Figure 4 | A working model showing the function of HopM1 and AvrE1, create an aqueous apoplast in the presence of bacterial 
HopM1 and AvrE1 in creating aqueous apoplasts. Pst-DC3000 delivers infection in a humidity-dependent manner. AvrE1 is localized to the host 
a total of 36 effectors into the plant cell. Many effectors, including AvrPto, plasma membrane”*. HopM1 targets AtMIN7 (an ARF-GEF protein) 
appear to suppress pattern-triggered immunity (PTI). AvrPto inhibits in the TGN/EE, which is involved in recycling of plasma membrane 
pattern recognition receptor (PRR) function®. Two conserved effectors, proteins”®, 
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Extended Data Figure 5 | Water-soaking is blocked during ETI. exposure (a) and bacterial populations were determined 24h after 

a, Col-0 leaves were syringe-infiltrated with Pst-DC3000 (1 x 10° cfu ml~!) __ infiltration to show similar population levels (b). Significant difference 
or Pst-DC3000(avrRpt2) (1 x 10’ cfu ml~'). Plants were kept under high in bacterial population was determined by Student's t-test (two-tailed); 
humidity (approximately 95%) for 24h to observe water soaking and *P—0.033. n= 3 technical replicates; data are shown as mean + s.d. 

then shifted to low humidity (approximately 25%) for 2h to observe ETI- Experiments were repeated three times. This is an experimental replicate 


associated tissue collapse. Images were taken before and after low humidity of Fig. 3b, c (without rps2 mutant plants). 
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Extended Data Figure 6 | Characterization of the npr1-6 mutant. 

a, A diagram showing the T-DNA insertion site in the npr1-6 mutant. 

Blue boxes indicate exons in the NPRI gene. b, RT-PCR results showing that the 
npr1-6 line cannot produce the full-length NPRI1 transcript. Primers used (NPR 
sequence is underlined): NPR1-F: agaattcATGGACACCACCATTGATGGA; 


NPRI-R: agtcgacCCGACGACGATGAGAGARTTTAC; UBC21-F: TCAAATG 
GACCGCTCTTATC; UBC21-R: TCAAATGGACCGCTCTTATC. Uncropped 
gel images are shown in Supplementary Fig. 1. c, The npr1-6 line, similar to 


npr1-1, is greatly compromised in benzothiadiazole (BTH)-mediated resistance 
to Pst-DC3000 infection. The Col-0, mpr1-1 and npr1-6 plants were sprayed 
with 100).M BTH and dip-inoculated with Pst-DC3000 at 1 x 108 cfu ml~! 24h 
later. Bacterial populations were determined 3 days after inoculation. Significant 
difference between mock and BTH treatment was determined by Student's t-test 
(two-tailed); *P =0.027; ***P=1.6 x 10-4; NS, not significant (P=0.19). n=3 
technical replicates; data are shown as mean + s.d. Experiments were repeated 
three times. 
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Extended Data Figure 7 | Construction and characterization of the mfec 
and mbbc quadruple mutants. a, CRISPR-Cas9-mediated mutations in 
the 4" exon of the AtMIN7 gene (exons indicated by blue boxes) in the 
quadruple mutant lines used in this study. The underlined sequence in the 
wild type (WT) indicates the region targeted by sgRNA. The number 399 
indicates the nucleotide position in the AtMIN7 coding sequence. +1 and 
—1 indicate frame shifts in the mutant lines. b, Col-0 and various mutants 
used in this study have similar growth, development and morphology. 
Four-week-old plants are shown. c, The mfec and mbbc plants show a 
tendency to develop sporadic water soaking under high humidity. 
Five-week-old regularly-grown (around 60% relative humidity) Col-0, 
mfec and mbbc plants were shifted to high humidity (approximately 95%) 
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overnight and images of mature leaves were taken after the high humidity 
incubation. d, Even leaves of mfec and mbbc plants that do not have 
sporadic water soaking have a tendency to develop some water soaking 
after hrcC” inoculation. Five-week-old Col-0, mfec and mbbc plants 
were dip-inoculated with hrcC~ at 1 x 10° cfu ml“, and kept under 
high humidity (approximately 95%). Leaf images were taken 2 days after 
inoculation. Images are representative of leaves from at least four plants. 
e, The non-pathogenic hrcC™ mutant causes significant necrosis and 
chlorosis in the quadruple mutant plants. Col-0, mfec and mbbc plants 
were dip-inoculated with the hrcC~ strain at 1 x 10° cfu ml~'. Images 
were taken 9 days after inoculation. This is one of the four independent 
experimental replications of the results presented in Fig. 5b. 
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Extended Data Figure 8 | Multiplication of endophytic nee 
bacterial community. a, An increase in the endophytic bacterial 
community in mfec and mbbc plants depends on high humidity. Col-0, 
mfec and mbbc plants were either sprayed with H2O and kept under high 
humidity (approximately 95%) or low humidity (around 50%). On day 5, 


total populations of the endophytic bacterial community were quantified. 


Statistical analysis was performed by one-way ANOVA with Tukey’s test 
(significance set at P< 0.05). Bacterial populations indicated by different 


atmin? mfec mbbc 


letters (a and b) are significantly different. n = 4 technical replicates; data 
are shown as mean + s.d. Experiments were repeated three times. b, Mild 
chlorosis and necrosis in leaves is associated with increased endophytic 
bacterial community level in the mfec and mbbc quadruple mutant plants. 
Plants were sprayed with H,O and kept under high (approximately 95%) 
humidity. Images were taken 10 days after spraying. Individual leaves 

are enlarged and shown in the lower panel, showing mild chlorosis and 
necrosis in some of the mfec and mbbc leaves. 
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Extended Data Bad 9| oe of 1 min as an effective surface 
sterilization time. Five-week-old Col-0 plants were sprayed with H2O and 
kept under high humidity (approximately 95%) for 5 days. Leaves were 
detached, surface sterilized in 75% ethanol for 20s, 40s, 1 min or 2min 
and then rinsed in sterile water twice. No sterilization (0s) was used as 
control. Leaves were ground in sterile water and bacterial numbers were 
determined by serial dilutions and counting of colony-forming units 

on R2A plates. Statistical analysis was performed by one-way ANOVA 
with a Tukey’s test (significance set at P< 0.05). Bacterial populations 
indicated by different letters (that is, a and b) are significantly different. 
n=4 technical replicates; data are shown as mean + s.d. Experiments were 
repeated twice with similar results. 
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Extended Data Table 1 | Endophytic bacterial taxa in Col-0, mfec and mbbc plants 


Order/ Family 


Bacillales 
Paenibacillaceae 
Burkholderiales 


Comamonadaceae 
Burkholderiaceae 
Alcaligenaceae 


Flavobacteriales 
Flavobacteriaceae 


Xanthomonadales 


Xanthomonadaceae 
Sphingomonadales 


Sphingomonadaceae 
Sphingobacteriales 


Sphingobacteriaceae 
Chitinophagaceae 


Rhizobiales 
Rhizobiaceae 
Cytophagales 
Cytophagaceae 
Pseudomonadales 


Pseudomonadaceae 
Actinomycetales 


Microbacteriaceae 


Col-0 


15 (30%) 


8 (16%) 
4 (8%) 
3 (6%) 


6 (12%) 
4 (8%) 


3 (6%) 


3 (6%) 
1 (2%) 


2 (4%) 


1 (2%) 


ND 


ND 


ND, not detected. See Methods for 16S rRNA amplicon sequencing procedures. 


mfec 


ND 


12 (24%) 
1 (2%) 
19 (38%) 


1 (2%) 
9 (18%) 


ND 


ND 
ND 


5 (10%) 


ND 


1 (2%) 


2 (4%) 


mbbc 


ND 


9 (18%) 
22 (44%) 
12 (24%) 


1 (2%) 
ND 


1 (2%) 


ND 
ND 


ND 


ND 


5 (10%) 


ND 
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The mechanism of force transmission at 
bacterial focal adhesion complexes 


Laura M. Faure!*, Jean-Bernard Fiche?*, Leon Espinosa!*, Adrien Ducret!*, Vivek Anantharaman‘, Jennifer Luciano!, 
Sébastien Lhospice', Salim T. Islam!, Julie Tréguier!, Mélanie Sotes!, Erkin Kuru®, Michael S. Van Nieuwenhze’, Yves V. Brun’, 
Olivier Théodoly’, L. Aravind*, Marcelo Nollmann? & Tam Mignot! 


Various rod-shaped bacteria mysteriously glide on surfaces in the absence of appendages such as flagella or pili. In the 
deltaproteobacterium Myxococcus xanthus, a putative gliding motility machinery (the Agl-Glt complex) localizes to 
so-called focal adhesion sites (FASs) that form stationary contact points with the underlying surface. Here we show that 
the Agl-Glt machinery contains an inner-membrane motor complex that moves intracellularly along a right-handed 
helical path; when the machinery becomes stationary at FASs, the motor complex powers a left-handed rotation of the 
cell around its long axis. At FASs, force transmission requires cyclic interactions between the molecular motor and the 
adhesion proteins of the outer membrane via a periplasmic interaction platform, which presumably involves contractile 
activity of motor components and possible interactions with peptidoglycan. Our results provide a molecular model of 


bacterial gliding motility. 


Certain rod-shape bacteria move along their long axes in the absence 
of extracellular appendages such as flagella or pili, in a process called 
gliding motility’. In Myxococcus xanthus, gliding is mediated by 
bacterial FASs. During gliding motility, FASs assemble at the lead- 
ing cell pole and retain a fixed position relative to the surface until 
they disassemble at the lagging cell pole’. FASs contain a molecular 
machinery, the Ag]-Glt (adventurous gliding and gliding transducer) 
complex. which comprises more than fourteen proteins**. Genetic 
analysis has suggested that this machinery is formed by two 
membrane-associated systems: (i) a putative three-protein TolQR- 
like proton motive force (PMF)-driven channel comprising AglR, 
AglQ and AglS (the suspected energy-producing system)*>®; and 
(ii) a putative eleven-protein integral envelope-associated complex 
(containing GltA-K) that interacts with the Agl system?*. These 
machineries are further connected to a cytosolic protein complex 
formed by the AglZ protein, the Ras-like G-protein MglA and the 
MreB actin cytoskeleton (hereafter called the MreB complex’; Fig. 1a). 
This MreB complex recruits, and promotes the assembly of the Agl- 
Glt complex (Fig. 1a). Directional movements of the Agl-Glt complex 
from the leading towards the lagging cell pole have been suggested to 
propel the cell forward*’, but how these movements may be trans- 
duced into cell movement remains unknown>”. Here, we reveal the 
functional architecture of the Ag]-Glt complex and establish how its 
activity is transduced to the contact surface across the highly struc- 
tured layers of the cell envelope. 


Intracellular helically moving engines drive propulsion 

We quantitatively characterized the dynamic behaviour of FASs by 
analysing the motions of Ag]Z-YFP (yellow fluorescent protein)- 
containing complexes using total internal reflection fluorescence 
microscopy (TIRFM). Cells attached to a chitosan-coated surface alter- 
nated between motile and non-motile states. In these conditions, we 
were able to capture the movement of AgIZ-YFP clusters over extended 


periods of time with high temporal resolution. Two main AglZ-YFP 
cluster populations were observed: static and dynamic (Fig. 1b; blue 
and orange, respectively). Motile cells on chitosan exhibited at least 
one static AgIZ-YFP cluster, indicating that a single static cluster is 
necessary and sufficient for cell propulsion. We also observed dynamic 
AglZ-YFP clusters. These clusters tended to form at one cell pole and 
migrate directionally towards the opposite pole (Fig. 1b and Extended 
Data Fig. 1a). On average, clusters formed every minute and moved 
at a constant velocity (3.2+0.9,1m min |, 1 =227) over distances of 
1.5+1 1m (n=203), often becoming dissociated upon reaching the 
opposite pole. Cluster speeds varied within and between cells (Extended 
Data Fig. 1b), possibly because of variations in the number of motor 
units in a cluster (see below) and changes in PMF levels between cells*®. 
Dynamic clusters are likely to represent unattached motility complexes 
because: (i) in motile cells, they were detected only ifa fixed cluster was 
also present; and (ii) in most cells the transition from a non-motile to 
a motile state coincided with cluster immobilization (>85%, n = 34; 
Fig. 1b, orange-blue cluster). To characterize this behaviour quanti- 
tatively, we measured the correlation (3) between cell movement and 
the presence of static and dynamic clusters (Fig. 1b, lower panel). 
Importantly, the presence of static, but not dynamic, clusters was highly 
correlated with cell movement (Fig. 1c, n=95). 

Close examination of dynamic clusters by TIRF revealed that they 
not only move between poles but also move across the cell width 
following a helical path (Fig. 1d). The helicity, characterized by the 
angle of a helix projected on a plane (y,, Fig. le), was constant between 
cells (a= 78° + 5°, n= 54; Fig. le). In most cases (92%, n = 54), the 
direction of rotation of AgIZ-YFP clusters was counterclockwise 
relative to the direction of movement, denoting a right-handed 
helical path. Treating the cells with A22, a drug that inhibits MreB 
polymerization’®, decreased the number of dynamic clusters per cell 
(Extended Data Fig. 1c, d) but, notably, not their helical movement or 
directionality (Extended Data Fig. le). 
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Figure 1 | The Myxococcus motility complex moves directionally along 
a helical path. a, Spatial regulation of the Myxococcus motility complex. 
A motile AgIZ-YFP-expressing cell showing a complete assembly cycle 

is shown at 30-s intervals. A current view of an Agl (blue)-Glt (yellow) 
complex at an FAS is shown”””, Assembly occurs at the leading cell pole 
following interactions between MglA-GTP, MreB and AglZ (right panel, 
blue®). The position of the Glt proteins is drawn based on published 
works**”°. b, Immobilization of AgIZ-YFP clusters correlates with 

cell movement. TIRFM of AglZ-YFP in a cell that shifts to motility on a 
chitosan-treated surface. Images were acquired every 0.5 s. Selected time 
frames and the corresponding high-resolution kymograph are shown. 
Two dynamic (orange) clusters are shown. Note that cell movement 
(indicated by the dashed line showing the initial cell position) is observed 


Moving cells rotate in a clockwise direction 

We reasoned that if propulsion was linked to the counterclockwise 
trafficking of AgIZ~YFP-containing motility complexes, then a glid- 
ing cell body should rotate along a similar helical path but of opposite 
handedness (that is, clockwise; Fig. 2a). To test this idea, we followed 
the dynamics of fiducial markers (artificial fluorescent D-amino acids, 
or TADA) fixed to the cell periphery during cell movement (Extended 
Data Fig. 2a—d). In motile cells, TADA clusters moved from one side 
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Figure 2 | Myxococcus cells rotate during motility. a, The helical 
movement of intracellular motors predicts rotation of the cell during 
movement. b, Rotation of a TADA-bright cluster in a motile cell (30-s 
intervals) and in a sum-type projection (right). The cartoon representation 
shows the position of the centroid of the cluster relative to the cell outline. 
c, Three-dimensional positions of a TADA-bright cluster measured by 
astigmatism (orange dots) illustrating the clockwise rotation of the motile 
cell (1 = 8; five biological replicates). Snapshots of the cell are displayed, 
illustrating the deformation of the microscope point spread function 

as a function of the axial position. d, Rotation angle of TADA-bright 
clusters ((r) as a function of cell velocity from astigmatism (filled circles, 
83.9° + 2.5°; n= 8, 5 biological replicates) and intensity variations 

(open circles 82.0° + 2.6°; n= 17, three technical replicates). 


only when a cluster becomes stationary (see orange—blue cluster). Scale 
bar, 2 um. Lower panel: calculation of the correlation coefficient (R) 
between the presence of a cluster and cell movement. The fixed cluster 
(blue) is highly correlated with cell movement (/3= 1), while the dynamic 
cluster (orange) is partially anti-correlated (= —0.3). c, Distribution of 
R for fixed (blue) and dynamic clusters (orange), n = 95 (six biological 
replicates). d, AgIZ-YFP clusters move along helical trajectories. TIRFM 
at selected time frames of a dynamic AglZ-YFP cluster in a non-motile 
cell are shown. Scale bar, 2j1m. (e) Measurement of the trajectory angle 
(Ya) from n= 54 (eight biological replicates) single trajectories of dynamic 
AglZ-YFP clusters (top panels). Histogram of p, and Gaussian fit (grey 
line) are shown. The mean angle is shown with a dashed vertical line and 
corresponds to counterclockwise trajectories. 


of the cell to the opposite side with angular velocities proportional to 
the speed of motility (Fig. 2b and Extended Data Fig. 2e), consistent 
with TADA clusters reporting on the overall rigid-body rotational 
movement of the cell during propulsion. 

To determine the direction of rotation, we directly tracked the 3D 
positions of TADA clusters during cell movement by inducing astig- 
matism into the optical detection path"! (see Methods). In agreement 
with our predictions, TADA clusters rotated in the clockwise direction 
during cell propulsion (Fig. 2c and Extended Data Fig. 2f; n = 8). These 
observations were confirmed by monitoring the changes in fluores- 
cence intensity produced when TADA clusters moved in and out of the 
imaging plane during cell movement (see Methods and Extended Data 
Fig. 2g;n =17). The rotation angle for TADA clusters (yr) was constant 
between cells (82.6° + 2.7°, n= 25, as measured by both methods; 
Fig. 2d) and closely matched the angle of rotation (y,) measured for 
AglZ-YFP clusters (78° + 5°). Interestingly, yy did not vary signifi- 
cantly with cell speed (Fig. 2d). Together, these findings indicate that 
anchoring of dynamic AglZ-YFP-containing complexes leads to the 
clockwise rotation and forward propulsion of the cell. 


Functional analysis of the Agl-Glt motility complex 
To gain further insight into the molecular mechanism of motility, we 
genetically dissected the functional groups composing the Agl-Glt 
machinery (Fig. la). Notably, mutations in ag/ and git genes led to 
aberrant localization patterns and perturbation of AgIZ—YFP cluster 
dynamics (Fig. 3a and Extended Data Fig. 3a). The global effect of each 
mutation on the localization pattern of AgIZ-YFP was evaluated by 
measuring four observables: the proportion of cells with AgIZ-YFP 
clusters, the mean number of clusters, and the longitudinal and radial 
distributions of clusters for wild-type and mutant cells (Fig. 3a and 
Extended Data Fig. 3b). Principal component analysis (PCA) was used 
to quantitatively characterize the effect of each mutant in the assembly 
of AglZ-YFP clusters (see Methods). This statistical method allowed 
us to convert the set of correlated observables into a set of linearly 
uncorrelated principal components (PCs). 

The first three principal components described >87% of the variance 
(Extended Data Fig. 4a—d). PC1 represented a linear combination of 
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the number of AglZ-YFP clusters, longitudinal cluster position, and 
proportion of cells with clusters. PC2 mostly represented the average 
number of clusters detected per cell, while PC3 was essentially domi- 
nated by the mean radial cluster position (Extended Data Fig. 4a-d). 
A synthetic representation of results can be obtained by plotting the 
mean position of each mutant in a principal component space where 
the wild type is arbitrarily positioned at the origin (Fig. 3b; for clarity 
only PC1 and PC2 are shown but the analysis is computed from PC1-3; 
Extended Data Fig. 4e). Therefore, for each mutant, the distance to 
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Figure 3 | Functional architecture of the Agl-Glt machinery. a, The 
localization of AgIZ-YFP is affected to different extents in agl-glt 

mutants. The number and position (longitudinal and radial) of clusters 
was determined in each mutant background and their effect analysed 

by PCA (see Methods). Scale bar, 1 j.m. b-d, The effect of mutations in 
assembly dynamics of Ag1Z-YFP, GItI-YFP and AglQ-mCherry plotted 

in PC coordinates. Triangles represent inner membrane (IM) group 
proteins, squares represent motor components and circles represent other 
components (periplasm-outer membrane (OM)). Right panels show 
schematic representations of the projections of the defined gene groups 
with respect to their predicted/shown localization from? *”°. Colour code 
represents distance to the wild type (WT) in PC space. Letters represent 
each protein subunit of the Agl-Glt complex. e, The compilation of PCA 
(b-d) superimposed to available data” *””° suggests that the motility 
machinery consists of a molecular motor (dark blue), an inner membrane- 
cytosolic group (light blue) and a large periplasm-outer membrane group 
(orange). 


the origin and the direction represent the relative effect of a mutation 
on the number and distribution of AgIZ-YFP clusters. Mutations in 
predicted inner membrane components (GItI, G, J) produced the 
largest perturbations in the assembly dynamics of AglZ-YFP clusters, 
whereas mutations in periplasmic outer-membrane group proteins 
(GItA, C, D, E, H, K) displayed the lowest effect (Fig. 3b). Interestingly, 
mutations in AglQ (motor) affected AgIZ-YFP cluster formation in a 
qualitatively different manner, represented by a direction in PC space 
orthogonal to that observed for other mutants. 

We refined the functional connections between agl and git genes by 
further investigating the effect of gene deletions on the formation of 
GItI-YFP’ and AglQ-mCherry*” clusters (GltI and AglQ are putative 
cytosolic and motor components of the gliding machine, respectively). 
The overall effects of gene deletions on the number and distribution 
of GltI-YFP clusters were similar to those observed for AglZ-YFP 
clusters, consistent with GltI and AgIZ belonging to the same functional 
group (Fig. 3c). Mutations in GltJ and GltG, two predicted inner 
membrane proteins” that might be in direct contact with GItl, had a 
large effect on GltI- YFP assembly (Fig. 3c). Interestingly, mutations 
in GltD, GItH and GItK had a larger effect than did mutations in other 
proteins in their subgroup (for example, GItA, B, C, E), suggesting that 
direct protein-protein interactions between these proteins and factors 
of the GltI subgroup may be needed for GItI-YFP localization (Fig. 3c). 
Finally, we analysed the effect of deletions in the assembly of motor 
components (AglQ). Strikingly, deletion of factors in all subgroups led 
to severe perturbations in the formation of Ag\Q-mCherry fluorescent 
clusters (Fig. 3d), suggesting that the motor may require several 
contacts with different Glt proteins to form functional clusters. 
However, specific components exhibited differential roles: deletion of 
GltJ,GltD and GItK had the largest effects, whereas deletion of GItB 
barely affected the formation of AgLQ-mCherry clusters (Fig. 3d). 
Overall, these data indicate that the motility complex is divided into 
several distinct functional groups (Fig. 3e). 


Motor docks to OM complex during propulsion 

To determine the sequence of events leading to the assembly of 
propulsive complexes, we imaged the dynamic localization of Ag]Z- 
YFP (cytosolic-inner membrane group) simultaneously with that of 
proteins belonging to each of the other functional subgroups: AglQ 
(motor), GltD (periplasmic), or GltC-mCherry (periplasm-outer 
membrane complex) (Fig. 4a, c, e). Under conventional epifluores- 
cence microscopy, cytosolic (AglZ) and motor components (AglQ) 
in motile cells (on 1.5% agar) displayed a strong ability to form 
clusters, in contrast to the dispersed localization of periplasmic and 
outer membrane complex components (GItC, D) to the cell periphery 
(Extended Data Fig. 5a). Thus, we used TIRFM to image the dynamic 
localization of these factors in motile cells adhered to chitosan-coated 
glass. AglQ—-mCherry and AglZ-YFP co-localized in both stationary 
and mobile clusters, indicating that these proteins form a stable 
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Figure 4 | Dynamic interactions between the intracellular and outer 
membrane complexes generate propulsion. a, Simultaneous dynamics 
of AgIZ-YFP and AglQ-mCherry and correlation to the motility phases. 
TIREM time-lapse images and corresponding kymographs are shown for 
individual fluorescent fusions and in a heat map showing the computed 
co-localization scores. b, AgIZ and AglQ co-localize in the inner 
membrane trafficking complex (n = 156 cells; two biological replicates). 
Correlation scores reflect the correlation between cell movement and 
static co-localized clusters. c, d, Simultaneous dynamics of AgIZ-YFP 
and GltD-mCherry and correlation to the motility phase (n = 121 cells; 
seven biological replicates). Legend as in a, b. e, f, Simultaneous dynamics 
of AgIZ-YFP and GltC-mCherry and correlation to the motility phase 


complex in the bacterial inner membrane (Fig. 4a; see also Methods 
and Extended Data Figs 5b, c, 6). To investigate whether clusters con- 
taining both AglZ and AglQ were linked to cell motility, we calculated 
for each cell the correlation between cell movement and co-localization 
and computed the distributions for both stationary and mobile clusters 
(Fig. 4b; see also Methods and Extended Data Fig. 7). The presence of 
stationary clusters was highly correlated to cell movement, whereas that 
of mobile clusters was not, consistent with AglZ and AglQ being part of 
the trafficking internal complex (Fig. 4b). By contrast, GltD-mCherry 
and GltC-mCherry co-localized with AgIZ-YFP only in stationary 
clusters (Fig. 4c, e and Extended Data Fig. 5b, c) and co-localization of 
stationary AglZ-GltC/D clusters was highly correlated to cell movement 
(Fig. 4d, f). These data strongly suggest that cytosolic-inner membrane 
and motor complexes assemble together in a mobile unit that requires a 
physical connection to periplasmic and outer membrane components 
to form stationary clusters that impart cell movement. 

To investigate this hypothesis further, we determined the correla- 
tion between cell speed and the proportion of clusters containing both 
AglZ and AglQ, GltC or GltD (Fig. 4g). Dynamic clusters containing 
AglZ and AglQ were observed in motile cells, but their number was 
independent of cell speed, indicating that they are not propulsive by 
themselves (Fig. 4g). On the contrary, the proportion of AgIZ-AglQ 
clusters that were stationary increased rapidly with cell speed (Fig. 4g). 
The number of AglZ—AglQ clusters reached a maximum at ~50% of 


Normalized speed 


Normalized speed Normalized speed 


(n= 100 cells; three biological replicates). Legend as in a, b. g, Correlation 
between cell velocity and number of active complexes. For a given value 
of the cell speed (V), the percentage of co-localized clusters represents the 
proportion of events in which a cell was moving at speed V and an active 
cluster containing both AglZ and AglQ, GltD or GItC was detected. The 
percentage of co-localization is shown as a function of normalized cell 
speed (where 1 is the maximum cell speed). h, Correlation between cell 
velocity and intensity of AglZ and AglQ clusters. Intensity was normalized 
with respect to the average intensity measured for dynamic clusters and 
shown as a function of normalized cell speed. Error bars represent s.e.m. 
calculated according to the total number of cells analysed (n = 114 cells for 
AglZ, n= 100 for AglQ). 


the cell maximum speed (V;,), suggesting that this number is not a 
limiting factor (Fig. 4g). In fact, the recruitment of GltD and GltC to 
FASs could be a limiting step because a substantial number of static 
AglZ clusters lack GltD and, especially, GltC (Extended Data Fig. 5b, c). 
Consistent with this finding, the number of stationary AgIZ-GltD and 
AglZ-GItC clusters was only ~60% of the maximum at Vy, and full speed 
was reached only when the percentage of GltD-GItC stationary clusters 
recruited to FASs saturated (Fig. 4g). Thus, the recruitment of GltD and 
GltC to FASs is required for propulsion, probably because these proteins 
belong to complexes that link the motor to the external surface. 

To test further whether the stoichiometry of Agl-Glt components 
regulates the activity of FASs, we measured how the mean fluorescence 
intensity of stationary and mobile clusters changed with normalized 
cell speed (Fig. 4h and see Methods). Cell speed increased with the 
number of AglZ and AglQ subunits accumulating in stationary clusters 
(Fig. 4h). By contrast, the proportion of these subunits in mobile 
clusters was systematically lower and uncorrelated with cell speed 
(Fig. 4h). All in all, these results show that stationary focal adhesion 
complexes form as a result of the transient recruitment of periplasmic 
and outer membrane proteins by the mobile PMF-driven inner 
membrane complex. Importantly, the activity of stationary complexes 
is regulated at two levels: by the number of Ag] motor units (force 
generation) and by the number of local contacts with periplasmic-outer 
membrane components (transmission). 
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Figure 5 | Cyclic contacts between the inner membrane motor and outer 
membrane adhesins drive propulsion. a, GltC-mCherry is released by 
gliding cells at focal adhesion complexes. Top, TIRFM snapshots for a 
representative cell expressing both GltC-mCherry (red) and AgIZ-YFP 
(green). The positions of the GltC clusters on the surface coincide with 

the positions of focal adhesion complexes (white asterisk). Time frames, 
15s. Scale bar, 21m. Bottom, variation in intensity of GltC-mcherry (red) 
and AglZ-YFP (green) as a function of time before (negative time) and 
after (positive time) the cell moved away from the focal adhesion position 
(n= 10; two technical replicates). b, Predicted domain architecture of 

the Agl-Glt machinery based on bioinformatics prediction, sequence 
analysis and previous literature. The different proteins of the complex are 
represented on the basis of their domain structures from bioinformatics 
predictions (Supplementary Table 1). Each protein is represented as a 
single copy in the complex. c, A fixed contractile zone is observed in motile 
sporulating cells where peptidoglycan is profoundly remodelled. Motile 
cells in the early phases of sporulation are shown at 1-min intervals. AglQ- 
sfGFP and GltC-mCherry are specifically enriched at the constriction 

site. Representative wild-type (n= 10), AglQ—sfGFP (n = 2, two technical 
replicates) and GltC-mCherry (n = 4, two technical replicates) cells are 
shown, obtained from two biological replicates. d, Possible mechanism of 
propulsion. The structure of the Agl-Glt machinery is simplified to its core 
components for clarity. The proton flow through a peptidoglycan-bound 
TolQR-type channel (yellow) is proposed to energize cyclic interactions 
between a flexible inner membrane-anchored periplasmic protein (GltG/J, 
black and yellow) and TonB-box proteins in the outer membrane (orange). 
Combined with the rigid anchoring to peptidoglycan and link with MreB, 
this activity would push the outer membrane protein laterally (red arrow) 
because peptidoglycan counteracts the exerted traction force (grey arrow). 
The protein stoichiometries are not known and it is possible that the 
complex contains several coordinated legs, facilitating the processivity and 
directionality of the movements. 


Adhesion and cyclic OM-IM contacts observed at FAS 

A clear prediction of this model is that the motility complex should 
require direct adhesive contacts with the underlying surface to propel 
the cell. Consistent with this prediction, reflection interference contrast 
microscopy (RICM) revealed that motile cells are uniformly in contact 
with the substratum and high RICM densities were correlated with the 
position of AgIZ-YFP clusters (Extended Fig. 5d). Notably, vesicles 
containing only outer membrane materials and specific outer mem- 
brane Glt proteins (GltC-mCherry) were deposited at sites coinciding 
with the positions of focal adhesion complexes in the wake of motile 
cells (Fig. 5a and Extended Data Fig. 5e, f). Overall, these data suggest 
that adhesions involve strong, intimate contacts between the surface 
and the periplasm-outer membrane complex. 
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To gain molecular insight into the mechanisms involved in force 
generation and surface adhesion, we predicted a protein-domain 
structure of the motility complex using bioinformatics approaches 
(Fig. 5b and Extended Data Figs 8, 9). Remarkably, both AglQ and 
AglS contain predicted TolR-like PG-binding motifs’ and AglR, a TolQ 
homologue interacts with GltG? a TolA/TonB-like protein (Extended 
Data Fig. 8a-b). GltG and GltJ are similar modular proteins, specifically 
sharing a single transmembrane helix, a periplasmic helical domain, 
and TonBC motifs'’. Interactions of GltGJ with OM components could 
occur between their TonBC domains and potential TonB-box-carrying 
proteins -GItF in the periplasm (Extended Data Fig. 8a) and GItAB, 
two predicted porin-like 3-barrel proteins in the OM (Extended Data 
Fig. 9a-b). 

Thus, peptidoglycan-anchored Agl motor units may act as stators, 
pushing against adhesive outer membrane complexes through pep- 
tidoglycan to generate propulsive forces. We reasoned that these 
dynamic interactions might be revealed under conditions in which 
the rigidity of the cell envelope is reduced, for example in cells under- 
going sporulation where rapid peptidoglycan remodelling leads to 
the formation of round cells!. Indeed, the induction of sporulation 
rapidly led to the formation of balloon-shaped cells (Fig. 5c). Notably, 
a small fraction of cells (<1%) entering sporulation were still motile 
and formed conspicuous constrictions that, similar to focal adhesion 
complexes, remained fixed relative to the surface (Fig. 5c). These 
constrictions probably resulted from the activity of the Agl-Glt 
machinery because: (i) they formed only in motile sporulating cells 
(<1% of the sporulating cell population); and (ii) both AglQ-sfGFP 
(inner membrane) and GltC-mCherry (outer membrane) were 
enriched at constriction sites (Fig. 5c). Thus, in sporulating cells 
where the structure of peptidoglycan is different (peptidoglycan is 
not detected in mature spores'°), dynamic Agl-Glt-driven physical 
contacts between the outer membrane and the inner membrane are 
unmasked. 


Discussion 

On the basis of our results, we propose that the Agl motor and associ- 
ated inner membrane proteins move directionally by cyclic interactions 
with factors in the periplasmic—outer membrane complex (Fig. 5d). 
Analogous to Tol-Exb systems'*"®, these steps would occur by PMF- 
driven conformational changes in the Glt TolA/TonB-like proteins 
(GltG and/or GltJ; Fig. 5b), the flexible domains of which might extend 
and retract through the peptidoglycan layer (Fig. 5d). Peptidoglycan 
itself could act as both a transient anchor point, as it becomes 
bound by AglQ/S via a TolR-like peptidoglycan-binding motif, and a 
guiding factor, as it opposes contractions to favour lateral movements 
(Fig. 5d). 

The current study does not resolve the relative stoichiometries 
of the Agl and Glt proteins at focal adhesions, but the data show 
that the activity of focal adhesion complexes is subject to regulation 
and contains variable stoichiometries of Ag] motor components and 
connections with the underlying surface. Thus, it is possible that 
several ‘legs’ operate coordinately at these sites. The directionality 
of motility complexes is remarkably consistent between cells, sug- 
gesting that a core cellular structure is involved; peptidoglycan is 
an attractive candidate because the glycan strands are proposed 
to have a global right-handed helical ordering that could serve as 
tracks to guide the motility complex!”. At FASs, the interaction 
with the surface appears to be strong, implying the existence of 
one or more specific adhesion molecule(s); consequently, there 
is likely to be a relief mechanism similar to that found in gliding 
parasites, where the major adhesion molecule is removed from 
the motility complex by specific proteolysis'*. In addition, our 
model makes several important predictions that will help future 
studies, such as the systematic exploration of protein interactions 
in the system to attain a molecular understanding of the motility 
mechanism. 
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METHODS 


Bacterial strains, plasmids, and growth. Strains, primers, and plasmids are listed 
in Supplementary Tables 2 and 3. See Supplementary Tables 2 and 4 for strains 
and their mode of construction. M. xanthus strains were grown at 32°C in casi- 
tone yeast extract (CYE)-rich medium as previously described”!. Plasmids were 
introduced into M. xanthus by electroporation. Mutants and transformants were 
obtained by homologous recombination based on a previously reported method”?. 
All fusions were expressed by gene replacement at the endogenous loci allowing 
expression from the natural promoters. Quantitative PCR experiments confirmed 
that expression was similar to wild-type levels (expression ratios varied between 
1 and 2 compared to wild-type levels). Three types of AglQ fusion were used 
throughout the study: TIRF experiments used AglQ—-mCherry expressed from 
the endogenous locus in place of the wild-type gene. For technical reasons, the 
PCA analysis and expression of AgQ-sfGFP used fusions expressed after ectopic 
integration of the gene of interest at the Mx8-phage attachment site in a AaglQ 
deletion background (Supplementary Table 4). All three strains were indistinguish- 
able in terms of motility and expression patterns. Escherichia coli cells were grown 
under standard laboratory conditions in Luria-Bertani broth supplemented with 
antibiotics, if necessary. 

Motility assays on agar surfaces. For standard phase-contrast and fluorescence 
microscopy, cells from exponentially growing cultures were transferred to a 1.5% 
agar pad with TPM buffer (10 mM Tris-HCl, pH 7.6, 8mM MgSOu,, and 1mM 
KH,PO,) ona glass slide and covered with a coverslip. Imaging was performed 
in a temperature-adjusted microscope chamber at 32°C. To test the function of 
peptidoglycan during motility, sporulation was induced by adding 5% glycerol 
directly into the agar pads as previously described'*. This treatment induces 
rapid and controlled degradation of the Myxococcus peptidoglycan leading to the 
formation of viable spheroplasts!. Cells were imaged 15 min after deposition on 
a glycerol pad in order to image their motility while converting to spheroplasts. 
Microfluidics on chitosan-coated glass slides. M. xanthus cells were immobilized 
ona chitosan-coated surface, as described previously”. In brief, custom-built poly- 
dimethylsiloxane microfluidic glass chambers were coated with chitosan solution 
and washed after 30 min. Chambers were further rinsed with 1 ml TPM buffer 
(10mM Tris-HCl, pH 7.6, 8mM MgSOx,, 1mM KH PO,). Subsequently, 1 ml of 
an exponentially growing culture was injected into the chamber and left for 30 min 
without flow. Unattached cells were removed by rinsing with 1 ml TPM buffer by 
manual injection and time-lapse microscopy on attached cells was performed. 
When needed, A22 (Merck Millipore) was injected manually at indicated concen- 
trations a few minutes after the cells were confirmed to be motile. 

Labelling cells with fluorescent D-amino acids. Lyophilized TADA (molecular 
mass = 381.2 g/mol) powder was re-suspended in DMSO at 150mM and conserved 
at —20°C. The labelling was performed for 2h at 32°C, using 211 TADA solution 
for 1 ml cell culture (OD¢09 = 0.5) in the presence of 1 M NaCl for a duration of 
2h. The sample was then washed four times with 1 ml TPM buffer and finally 
resuspended at OD¢o9 = 5 before being transferred to an agar pad. 
Bioinformatics analyses. Iterative sequence profile searches were performed using 
the PSI-BLAST*? and JACKHMMER™ programs run against the non-redundant 
protein database of National Center for Biotechnology Information (NCBI). 
Similarity-based clustering for both classification and culling of nearly identi- 
cal sequences was performed using the BLASTCLUST program (ftp://ftp.ncbi. 
nlm.nih.gov/blast/documents/blastclust.html). The HHpred program was used 
for profile—profile comparisons”’. Structure similarity searches were performed 
using the DaliLite program”®. Multiple sequence alignments were built by the 
Kalign and PCMA programs?””8, followed by manual adjustments on the basis 
of profile-profile and structural alignments. Secondary structures were predicted 
using the JPred program”. For previously known domains, the Pfam database was 
used as a guide”, and the profiles were augmented by addition of newly detected 
divergent members that were not detected by the Pfam models. Clustering with 
BLASTCLUST followed by multiple sequence alignment and further sequence pro- 
file searches were used to identify other domains that were not present in the Pfam 
database. Signal peptides and transmembrane segments were detected using the 
TMHMM and Phobius programs*"*. Contextual information from prokaryotic 
gene neighbourhoods was retrieved using custom Perl scripts that extract the 
upstream and downstream genes of the query gene, along with their orientation, 
from GenBank files. A combination BLASTCLUST and sequence profile searches 
were then used to cluster the proteins to identify conserved gene-neighbourhoods. 
Structural visualization and manipulations were performed using the PyMol 
(http://www.pymol.org) program. The in-house TASS package, which comprises 
a collection of Perl scripts, was used to automate aspects of large-scale analysis of 
sequences, structures and genome context. 

Epifluorescence microscopy. For regular epifluorescence, GFP or mCherry 
fluorochromes were visualized at 32°C using a temperature-controlled TE2000- 


E-PFS microscope (Nikon) with a 100x NA 1.3 (PhC) objective and a CoolSNAP 
HQ2 camera (Photometrics). All fluorescence images were acquired with 
appropriate filters with a minimal exposure time to minimize bleaching and 
phototoxicity effects. Images were recorded with Metamorph software (Molecular 
Devices) 

TIRFM. TIREM was performed with an inverted microscope (Axio Observer Al, 
Zeiss) equipped with a 100 x Plan-Achromat oil-immersion objective (NA = 1.46) 
mounted on a closed-loop piezoelectric stage (PIFOC, Physik Instrumente). Two 
lasers with excitation wavelengths of 488 nm (OBIS 488LS, Coherent USA) and 
561 nm (Sapphire 561LP, Coherent USA) were used for YFP and mCherry imaging, 
respectively. Laser beams were combined and collimated by a series of dichroic 
mirrors and achromatic lenses, individually controlled by an acousto-optic tunable 
filter (AOTE, AA Opto-electronic) and focused onto the back focal plane of the 
objective through the rear port of the microscope. A translation stage was used to 
shift the positions of the two beams with respect to the objective, enabling an easy 
permutation between Epi and TIRF imaging (Applied Scientific Instrumentation). 
The fluorescence emission signal was collected by the objective lens, separated 
from the excitation wavelengths through a four-band dichroic mirror and filtered 
using bandpass filters inserted in a high-speed motorized filter wheel (Chroma 
Technology). The filtered emission signal was then imaged onto an emCCD 
camera (Andor Ixon 897) through relay lenses allowing an effective pixel size of 
105nm. Along a separate path, a 785-nm infrared laser beam was focused on the 
back focal plane of the objective and reached the glass—sample interface in total 
internal reflection conditions. The reflected infrared beam was imaged by a CMOS 
camera (Thorlabs Inc.) and its position calculated during live acquisition. The 
information was fed-back to a z-positioning piezo stage through PID software 
to correct for any change in the objective-sample distance. This active autofocus 
system locks the focal plane position with a precision of +20nm over hours. All 
acquisition software controlling lasers, filter wheel, translation stages and cameras 
were homemade using LabView 2012 (National Instruments). For time-lapse 
imaging, a high-speed motorized filter wheel was used to sequentially image 
YFP and mCherry channels with a switching time of less than 200 ms. Typically, 
5-10 images were acquired at 20 Hz for each channel and the process was repeated 
45-50 times every 15-30s, depending on the cell speed. For real-time imaging, 
500 images were taken at 20 Hz in the YFP channel. Laser intensity was optimized 
to get the best signal-to-noise ratio (SNR) while limiting photobleaching and 
phototoxicity. In order to compensate for chromatic aberrations, each channel 
was assigned a specific setpoint for the autofocus depending on the position of the 
focal planes for YFP and mCherry fluorescence signals. 

Astigmatism for 3D time-lapse experiments. First, analysis of TADA-bright 
clusters by epifluorescence showed that they are diffraction limited and circular 
in shape, making their 3D tracking by astigmatism possible. 3D imaging was 
performed as previously described*. Briefly, to obtain 3D imaging conditions, a 
corrective MicAO 3D-SR system (Imagine Optic) was inserted into the emission 
pathway between a modified Nikon Eclipse Ti-S inverted microscope and an 
EMCCD camera (Andor Ixon 897). A MicAO 3D-SR adaptive optics device was 
used to correct the microscope point spread function (PSF) for optical aberra- 
tions introduced in the imaging path and to optimize the photon budget. For 3D 
detection, subtle changes in astigmatism were further added in order to break 
the axial symmetry of the PSF and allow an estimation of the axial position of 
diffraction-limited fluorescent objects. Time-lapse imaging of TADA clusters 
was performed after transferring to an agar pad freshly labelled cells mixed with 
100-nm fluorescent beads (TetraSpeck, Thermo Scientific), used as fiducial 
and calibration markers. Every 15-30, two series of 5-10 successive images 
were simultaneously acquired at 20 Hz: one under continuous epifluorescence 
illumination with a 561-nm read-out laser to detect the fluorescence of the TADA 
clusters (Sapphire 561LP, 150 mW, Coherent) and the second using a brightfield 
illumination to get an image of the cells. Then, a calibration was performed on the 
same field of view by imaging single fluorescent beads while scanning the sample 
along the optical axis (z) by steps of 50 nm. 

Detection and quantification of TADA clusters is described below. In total, data 
from eight cells were successfully analysed (five biological replicates). However, 
the behaviour described was clearly observed on numerous occurrences (n= 10), 
though the data were not of high enough quality (low SNR, cell to cell contact, 
cell moving too quickly or out of the observation field and so on) to obtain long 
trajectories. 

RICM. For label-free imaging of the underside of cells in contact with the 
substratum, a modified form of reflection imaging for bacteria was employed". 
RICM was performed on chitosan-coated microfluidic chambers. Channels 
were seeded with ApilA cells expressing AgIZ-YFP resuspended in TPM buffer 
with 1 mM CaCl, (OD¢09 = 0.5) for 10 min, then washed with the same buffer. 
Images were obtained at 32°C on a Zeiss Axiovert 200 inverted microscope 
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with adjustable aperture and field stops. For imaging, an RICM objective (Zeiss 
Neofluar 63/1.25 antiflex) and a differential interference contrast objective (Plan- 
Apochromat 63 x 1.40 oil) were used for crossed-polarized light and fluorescence 
images, respectively. For RICM, cells were illuminated through a 546 + 12 nm 
narrow bandpass filter with a mercury lamp (X-cite 120Q lamp) for 20 ms. For 
fluorescence, samples were excited with a laser at 488 nm for 1,000 ms. Images 
were captured at 10-s intervals and combined in ImageJ. 

Fluorescence recovery after photobleaching (FRAP). FRAP was performed 
with a 488-nm laser mounted directly on the TE2000-E-PFS microscope (Nikon), 
allowing us to focus a micrometre-radius laser beam with micrometre precision. 
FRAP acquisitions were performed with a home-developed macro under 
Metamorph. 

Cluster image analysis. Trajectories of fluorescent clusters were obtained from 
the coordinates calculated in a cylindrical reference system based on the shape of 
the cell. This step was performed automatically and verified manually using the 
Microbe) plugin (http://www.indiana.edu/~microbej/) in FIJI/Image]*°. These 
coordinates were used to calculate angle, speeds, mean square displacements 
and angles using R software. To calculate the slope of helicoidal trajectories, the 
unwrapped coordinates were fitted with a linear regression model and y angles 
were determined as the arctangent of the line coefficient (Extended Data Fig. 2e). 
In TIRFM, owing to the narrow depth of focus, different fluorescent objects 
(AglZ-YFP) were treated as if they were in the same plane. This simplified the 
method of calculating the trajectory angles. A z-stack projection of maxima was 
applied (FIJI). Some cells in the projection image showed clear linear alignments 
of clusters crossing the cell body. The angle between the linear clusters trajectory 
and the major axis of the cell was manually measured with the angle tool of the 
FIJI software. 

Principal component analysis. The number of clusters and the relative positions 
of AglZ-YFP, GItI-YFP and AglQ-mCherry in agl;git mutant backgrounds were 
obtained by combining the mask of cells obtained from phase contrast images 
and the fast Fourier transform (FFT)-filtered fluorescent image (FIJI, Extended 
Data Fig. 3b). In brief, the phase-contrast image of rod-shaped cells provides a 
mask of cell bodies and yields morphological parameters and definition of the 
longitudinal axis. Following a straightening operation, this axis can be used as a 
reference frame for cluster localization. Because the internal clusters are of weak 
intensity, the images were denoised by applying a neutral density filter and back- 
ground subtraction. The presence of fluorescent clusters was systematically verified 
on unprocessed images to ensure that the procedures did not generate artefactual 
signals. The following list provides the number of cells analysed for each condition 
resulting from six technical replicates. For the AgIZ-YFP reporter: wild-type, 67; 
AgltA, 146; AgltB, 407; AgltC, 224; AgitD, 139; AgitE, 168; AgitF, 136; AgitG, 
323; AglitH, 114; AgltI, 193; Agit], 174; AgltK, 194; AgitQ, 407. For the GltI-YFP 
reporter: wild-type, 187; AgitA, 688; AgltB, 292; AgltC, 321; AgitD, 133; AgltE, 
296; AgltF, 114; AgltG, 64; AgltH, 261; Aglt], 359; AgitK, 694; AgitQ, 187. For the 
AglQ-mcherry reporter: wild-type, 163; AgitA, 124; AgitB, 723; AgitC, 266; AgitD, 
201; AgltE, 399; AgitF, 196; AgltG, 189; Aglt}, 511; AgltK, 197. 

Two-colour TIRFM. Image analysis was performed using Matlab 2015 
(MathWorks). For each experiment, only cells displaying gliding displacement 
were analysed. For each selected bacterium, a temporal RGB image was calculated 
using mCherry fluorescence images and the cell trajectory path was manually 
drawn (Extended Data Fig. 6a). A montage and its associated kymograph were 
then calculated for both YFP and mCherry channels by straightening and re-slicing 
each time-lapse image along that path (Extended Data Fig. 6b). For the kymograph, 
the best contrast was obtained by averaging the intensity of the three brightest 
pixels (over a total of 13 pixels) for each slice. To further improve the contrast of 
the kymographs and highlight the presence of fluorescent clusters, a denoising 
algorithm (modified from ref. 36) was applied (Extended Data Fig. 6c). 
Photobleaching was also quantified for each channel based on the variation of 
non-specific fluorescence signal measured in the cells over time. A single expo- 
nential model was used to fit the variation of intensity and correct the kymograph 
intensity accordingly. Small shifts induced by chromatic aberrations between the 
two channels were corrected as well by realigning the two kymographs using an 
image cross-correlation algorithm (precision of 1 pixel, 105 nm). Finally, for the 
co-localization calculation, the intensities of both images were standardized in 
order to compare the fluorescent signals from both YFP and mCherry channels. 
Cell tracking in kymographs. To calculate the cell displacement over time, the 
kymograph displaying the best contrast was first interpolated: the number of pixels 
was increased by twofold in each dimension and the intensity of each pixel recalcu- 
lated using a cubic interpolation algorithm. An iterative edge-detection algorithm 
based on the Canny method was then applied to the interpolated image*”. Typical 
values for the standard deviation were between 3 and 5 pixels. By iteratively 
changing the Canny detection threshold, the algorithm converged towards two 
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different edges highlighting the movement of both cell poles during the time- 
lapse. Errors in the edge calculation were sometimes observed when the lagging 
pole of the cell was not properly attached to the chitosan-treated surface and/or 
when other bacteria were in contact with the selected cell. In that case, a manual 
correction was performed in order to remove inconsistent points and rectify the 
position of the edges. 

Often, the two edges were not exactly identical owing to imprecision in the 

localization of poles. Indeed, we could observe that the leading pole of the cell 
was usually very well defined (highest fluorescence intensity) resulting in a very 
precise calculation of its localization over time. On the contrary, the lagging 
pole was often poorly defined due to lower fluorescence intensity and/or weaker 
adhesion to the chitosan-treated surface. Therefore, in order to quantify the cell 
displacement during the time-lapse, the edge defined with the highest preci- 
sion was selected. The speed was also estimated from the first derivative of the 
displacement and analysed in order to automatically identify when the cell was 
mobile (gliding) during the time-lapse acquisition. To do so, for each experiment, a 
velocity threshold was empirically estimated by analysing the movement of a small 
subset of cells. Then, for each time-point, bacteria were classified as either mobile 
or immobile depending on whether their velocity surpassed the velocity threshold. 
Typical values for the threshold lay between 0.05 and 0.21m/min, depending on 
experimental drift and noise (Extended Data Fig. 6d). In average, cell speed on the 
chitosan-coated surface was 0.5 + 0.5 jzm/min. 
Co-localization. Co-localization between YFP-tagged and mCherry-tagged 
proteins was calculated in three steps (Extended Data Fig. 7a). First, the two 
montages were separately analysed in order to localize the fluorescent clusters in 
each channel. For each montage, its normalized cross-correlation with a Gaussian 
spot (s.d. of 1 pixel) was computed. Fluorescent aggregates were then delimited by 
thresholding the image (typical value 0.55-0.75) and grouping the selected pixels 
in separate clusters based on their connectivity. Clusters composed of fewer than 
4 pixels were systematically discarded. 

Ina single cell, many different fluorescent clusters C,, = ,2,.. could be simulta- 
neously detected, each with a different trajectory. As the same fluorescent cluster 
C,, could be detected in several successive images, localizations were grouped by 
manually stitching together its detections (C,", C,*1, ... Cal'**) on the montage. 
Therefore, each cluster C,,= 1,2, was now defined by all its localizations during 
the time-lapse acquisition. By analysing their positions over time, each observa- 
tion of the same cluster was labelled as either stationary or dynamic. Finally, it 
is important to point out that clusters detected at the poles were not taken into 
account for the analysis. The pixel size in our experiments was 105 nm; thus, the 
precision of detection of the maximum of intensity of clusters from kymographs 
was approximately ~100 nm. 

Next, for each montage, the localizations of the clusters were projected onto the 
associated normalized kymograph. A protein detection map was then calculated 
by keeping only the intensities of the pixels that were part of a cluster (Extended 
Data Fig. 7b). The intensity of all other pixels was set to zero. By superimposing 
the trajectories of the two cell poles, it was then possible to study how the detection 
of protein clusters was correlated with cell displacement. Each row of the map 
corresponded to a snapshot of the protein localization and intensity within the cell 
at a given time of the experiment. Finally, the co-localization map was calculated 
by multiplying the two detection maps together (Extended Data Fig. 7c). From 
the distinction performed earlier between stationary and dynamic clusters, we 
could measure how the co-localization intensity varied over time for both types 
of clusters. Two curves were therefore calculated by summing along each row the 
intensity of the pixels associated to either dynamic or stationary clusters. In the end, 
the total co-localization signal was obtained by summing together the two curves. 
Correlation between cell movement and co-localization. For each cell, correlation 
between movement and protein co-localization was calculated, taking static and 
dynamic clusters into account separately (Extended Data Fig. 7d, e). For each time- 
point, the correlation was set to 1 when the cell was mobile and co-localization was 
detected or when the cell was immobile and no co-localization was measured. In 
all the other cases, the correlation was set to —1 (Extended Data Fig. 7f). Then, a 
correlation score was attributed to each cell and each type of cluster (stationary or 
dynamic) by calculating the mean of all the values over time. 

To further estimate the connection between cell movement and protein co- 
localization, a more quantitative analysis was performed. For each cell, speeds 
were first renormalized between 0 (no movement) and 1 (maximum velocity) 
and segmented into 15 equally spaced bins. Next, for each bin, we selected all the 
occurrences associated to a moving cell and verified whether static or dynamic 
co-localized clusters were detected. By repeating this analysis on all cells, we could 
therefore estimate for each bin the percentage of events where co-localization was 
detected. Finally, the percentage of co-localization was plotted separately for static 
and dynamic clusters as a function of the normalized cell speed. 
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Correlation between cell movement and cluster fluorescence intensity. To inves- 
tigate the connection between cell movement and protein recruitment at focal 
adhesion complexes, we analysed how the intensity of AgIZ and AglQ clusters 
changed with cell speed. Owing to cell-to-cell variability in the measured fluores- 
cence signal, the intensity of static clusters was renormalized for each cell. For the 
cells displaying at least one dynamic cluster, the maximum intensity of dynamic 
clusters was calculated and used as an internal reference to normalize the intensity 
of all detected clusters within the cell. Behind this normalization procedure, we 
make the assumption that dynamic clusters have on average the same stoichiom- 
etry or composition from cell to cell. This assumption was verified since, after 
analysing more than 100 cells for AgIZ and AglQ proteins, the intensity measured 
after normalization for the dynamic clusters showed a very small dispersion (mean 
intensity of 1 + 0.15 for AgIZ and AglQ) while the dispersion for the static clusters 
was about three times higher (mean intensity of 1.38 + 0.5 for AglZ and 1.4+ 0.43 
for AglQ). Then, as already described for the correlation between cell movement 
and co-localization, we plotted the intensity of both static and dynamic clusters as 
a function of the normalized cell speed. 

Note that this analysis could only be performed for AglZ and AglQ, as in this 

case the proportion of cells displaying both dynamic and static clusters was large 
enough to obtain a statistically relevant sample size (59% for AglZ and 64% for 
AglQ). For GltC and GltD, however, very few dynamic clusters were detected and 
these proportions dropped to 5% and 8%, respectively, which was insufficient for 
performing a similar analysis. 
Analysis of TADA clusters by astigmatism. Detection of beads from the 
calibration were analysed using RapidSTORM™. For each axial position (z) of the 
sample, the PSF of a single fluorescent bead was fitted with an elliptical Gaussian 
function and the x-y widths (w, and wy) were calculated in order to produce the 
calibration curves w,(z) and w,(z). 

Only gliding cells displaying a bright and well-defined TADA cluster were 

selected for the analysis. First, the calibration curves were used to infer the 
axial positions of the clusters using RapidSTORM. Then, the cell outline was 
calculated on each image using the bright-field images. The cell outline was used 
to reconstruct the complete trajectory of the cell during the time-lapse and infer the 
lateral position of the TADA cluster. From the trajectory, we precisely determined 
for each image the total distance travelled by the cell since the beginning of the 
acquisition. Finally, the 3D position of the TADA clusters was plotted as a function 
of the distance travelled by the cell. 
Principal component analysis and statistical tests. The PCA analysis was 
performed using Matlab 2014 and the Statistics and Machine Learning Toolbox. 
We started by defining a data matrix C; for each mutant (WT, dA, dB, dC, ...) and 
reporter (AgIZ, AglQ and GItI). For example, in the case of AglZ as a reporter, we 
defined one array Cwr for the wild-type strain and twelve others (Cga, Cup, Cac...) 
for each mutant. Each row of an array C; corresponded to a single observation of a 
cluster measured by fluorescence microscopy. The columns corresponded to the 
variables used to describe the properties of the clusters and of the associated strain: 
(i) longitudinal position of the cluster (0 at the leading pole, 1 at the lagging pole); 
(ii) lateral position (0 at the centre, 1 on the edge); (iii) total number of clusters 
detected simultaneously in the same cell; and (iv) proportion of cells containing 
at least one cluster for the selected strain and reporter. 

For each reporter, the PCA calculation was performed by combining all the 
data collected (wild-type and mutated strains). Therefore, a single set of principal 
components were defined and subsequently used to illustrate the differences 
between the wild-type and the mutated strains. The analysis was performed in 
three steps. First, all the data-matrices C; were concatenated in a single matrix Cy 
and the values of each column were centred (mean was equal to 0) and standardized 
using the inverse variance. Then, the PCA analysis was performed and returned 
the coefficients of the principal components (PCs, Extended Data Fig. 4a—c) as 
well as the amount of variance accounted for by each of them (Extended Data 
Fig. 4d). Independently of the nature of the reporter, we observed that the three 
first components accounted for more than 87% of the total variance (>68% for 
the two first components). Therefore, the clusterization calculation was performed 
later using only the three first PCs. 


Ina second step, the values of the PCs were used to calculate the representations 

C?° of each data-matrix C; in the principal component space. Using a system of 
axes defined by the three first PCs, we could represent each array C’° by a scatter 
plot in which each point represents the properties and localization of one single 
cluster in the PC space (Extended Data Fig. 4e). To improve the readability of the 
plots, we represented only the median position of each scatter plot (Extended Data 
Fig. 4e). We also used the standard deviation to illustrate the dispersion of the data 
along each PC. Finally, in order to make the comparison between wild-type and 
mutated strains easier, we arbitrarily placed the wild type at the origin. 
Statistics and replicates. By default we used the Wilcoxon (two-sided) test to 
significantly separate the different samples. For each experiment, the number of 
times it was independently replicated in the laboratory (biological replicate) is 
indicated either in the figure legend or in the corresponding Methods section. All 
errors calculated on values were determined using the deviation standard formula. 
Data and code availability. Code used for two-colour TIRF and astigmatism 
experiments was written using Matlab 2015. Scripts are available upon request. The 
data that support the findings of this study are available from the corresponding 
author upon request. 
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Extended Data Figure 1 | Dynamic AglZ-YFP clusters in non-motile 
cells. a, Dynamic Ag]Z-YFP clusters in a non-motile cell observed by 
TIREM. Kymograph representation of cluster movement captured every 
0.5s. Note that the clusters form at the cell pole and move directionally 
towards the opposite cell pole where they are dispersed. Scale bar, 21m. 
b, Distribution of cluster speeds between and within cells. Note that 
clusters can move at different speeds in a cell and that the speed between 


0 5 10 20 50 


10 20 50 A22 (ug/ml) 
A22 (ug/ml) 


cells generally varies between 2 and 4\1m min (two technical replicates). 
c, Number of AglZ-YFP clusters per cell per minute in wild-type and A22- 
treated cells (two technical replicates). d, AgIZ-YFP cluster speed in wild- 
type and A22-treated cells (two technical replicates). e, Trajectory angles 
in wild-type and A22-treated cells (two technical replicates). Wilcoxon 
tests; *P< 0.1; **P< 0.01; ***P< 0.001 (c-e). 
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Extended Data Figure 2 | See next page for caption. 
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Extended Data Figure 2 | Myxococcus cells rotate along their long 
axes during motility. a, TADA-bright clusters form in Myxococcus 

cells subjected to a brief osmotic shock. TADA is incorporated in the 
Myxococcus cell envelope only when the cells are subjected to NaCl 
treatment (see Methods). b, c, TADA-bright clusters are not dynamic in 
non-motile cells. TADA-bright cluster movements are not detectable in 
non-motile cells (c, red dots, n = 14; two technical replicates) and only 
detectable in moving cells (c, blue dots, n = 17; two technical replicates). 
MSD, mean square displacement. Scale bar, 2 1m. d, TADA-bright clusters 
are inert. FRAP analysis reveals the absence of fluorescent molecule 
exchange in TADA-bright clusters (n = 6; one technical experiment). 
Scale bar, 2jm. e, TADA-bright cluster rotation reflects rotation of 

the cell during movement. A cell on which four TADA-bright clusters 
were tracked is shown. The radial velocity of each cluster calculated by 


ARTICLE 


projection of the 2D images on the model 3D cell cylinder (left, 3 angle) 

is plotted against the linear displacement of the cell. Each TADA cluster 
moved at the same radial speed and proportionally to the speed of the cell, 
indicating that TADA clusters are inert objects reporting on the rigid-body 
movement of the cell. f, Three dimensional trajectories of TADA-bright 
clusters reconstructed by astigmatism. In the absence of astigmatism, the 
size of TADA clusters corresponds to the diffraction limit of light, and 
they are circular (that is, the size of the PSF is the same in perpendicular 
directions), making the astigmatic analysis of axial position possible. 

g, TADA-bright clusters rotate clockwise. Cluster intensity fluctuations 
and positions relative to the cell axis are shown over time (left) ina 
representative cell (right). The black arrow points to the analysed cluster. 
The representative cell shown in the right panel was isolated from others 
in the field with a black mask (n = 10; three technical replicates). 
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Extended Data Figure 3 | Analysis of AgIZ-YFP, GltI-YFP and 
AglQ-mCherry in agl and git mutant backgrounds. a, Each fluorescent 
functional fusion gene was introduced in place of the wild-type gene 

in each genetic background shown. Typical examples are shown for 

each strain. Crossed boxes indicated genetic backgrounds that were not 
obtained for this study. Scale bar, 2 1m. b, Cluster detection and analysis 
chart. Phase contrast and fluorescence images were processed so as to 
respectively extract cell masks of isolated cells (compared edited mask 
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FFT band pass Filter 
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% MicrobeJ 


Raw data worksheet : 
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to raw mask) and the position of fluorescence clusters following the 
application of a fluorescence bandpass filter. Note that the intensity of the 
fluorescence clusters was not exploited owing to lack of robustness and 
day-to-day fluctuations. The cluster coordinates were then defined relative 
to cell coordinates with the Microbe J plugin (http://www.indiana.edu/ 
~microbej/) in Fiji, compiled in R sheets and further analysed by PCA 
using custom-written code in Matlab. 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


ARTICLE 


a 
IZ PC1 PC2 PC3 PC4 
Longitudinal Position 0.5677 0.2464 -0.1929 0.7614 
Axial Position 0.5054 -0.0425 0.8491 -0.1479 
Number of cluster 0.3732 -0.8831 -0.2773 -0.0627 
Proporstion of cells 0.5320 0.3970 -0.40€ -0.6280 
b 
Gitl PC1 PC2 PC3 PC4 
Longitudinal Position 0.5953 0.0891 -0.0150 0.7984 
Axial Position 0.4817 -0.1351 0.8009 -0.3290 
Number of cluster 0.4377 -0.7101 -0.4883 -0.2563 
Proporstion of cells 0.4712 0.6852 -0.3462 -0.4342 
c 
AgilQ PC1 PC2 PC3 PC4 
Longitudinal Position 0.6142 -0.0131 0.3934 0.6840 
Axial Position 0.5559 0.1406 -0.8189 -0.0255 
Number of cluster 0.2323 0.8417 0.3138 -0.3729 
Proporstion of cells 0.5098 -0.5211 0.2760 -0.6264 
d Gitl 
100 100 
= = 88% HH 
= < 80 SS 80 
3 3 LE - 68% - 8 
is & 60 = 60 
wi 
® 5 40 5 40 
1S) 
< 2 2 
£ @ & 
a s 20 £ 20 
rea 0 a 0 oat 
Principal Component Principal Component Principal Component 
e 


@eAA @ AD @AG e@AJ 
eAB © AE @AH e@AK 
@eAC e@AF e@Al ®AQ 


e WT 


Extended Data Figure 4 | See next page for caption. 
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Extended Data Figure 4 | PCA of AglZ-YFP, GItI-YFP and AglQ- 
mCherry in agl and git mutant backgrounds. a-c, Coefficients of the 
principal components (PCs) for AgIZ-YFP, GItI-YFP and AglQ-mCherry. 
PCs are the eigenvectors of the correlation matrix calculated from the 

four parameters indicated in the first column of the tables. Together, PCs 
form an orthogonal basis in which the vectors are uncorrelated. PCs are 
sorted according to the amount of variability in the data they describe, 
PC1 having the largest effect (that is, variance) and PC4 the least. d, Scree 
plots displaying the variance associated to each PC. The bar plot represents 


the variance associated with each PC for a given fusion (from left to right: 
AglZ-YFP, GltI-YFP and AglQ-mCherry). The cumulative variance is 
also plotted (light blue line). Note that PC1-2 describe on average 70% 

of the total variance and PC1-3 more than 87%. e, Projection of the 

data in the space defined by the three first PCs. For each mutant, data 

are represented by a scatter plot of a specific colour (see inset for colour 
code). For each direction and each mutant, the average and s.d. of the data 
are symbolized by a single bold line: the centre of the line represents the 
average and its length the s.d. 
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Extended Data Figure 5 | Motility is propelled by cyclic interactions 
between the inner membrane-localized motor and outer membrane- 
localized adhesins of the motility complex. a, Epifluorescence analysis 
of representative cells expressing AgIZ-Y FP-AglQ/GIltD/GltC-mCherry. 
In each case, fluorescent functional fusions are expressed in place of the 
wild-type gene. Note that while AgIZ-YFP and AglQ-mCherry clusters 
can be detected, GltC-mCherry and GltD-mCherry appear mostly diffuse 
around the cell envelope with these imaging conditions. Forty cells were 
imaged for AgIZ~YFP-AglQ with two biological replicates; 232 cells 

were imaged for AgIZ-YFP-GltD with three biological replicates; and 55 
cells were imaged for AgIZ-YFP-GltC with three biological replicates. 
Scale bar, 21m. b, Protein co-localization in static clusters. For each cell 
analysed, a percentage of co-localization is computed for proteins detected 
only in static clusters. Values range from 0 when no co-localization between 
the two proteins was detected in the cell to 100% when the two proteins 
were always detected together. The left panel shows the percentage of 
co-localization for AglQ, GItD or GltI with respect to AglZ. Single data 
are represented by a scatter plot (0), the median co-localization value is 
symbolized by a blue line and the s.d. by light grey boxes. On average, 96% 
of AglQ clusters colocalized with AglZ (n = 153 clusters; two biological 
replicates), 96% of GltD (n = 120; seven biological replicates) and 83% 

for GltC (n = 100; three biological replicates). Inversely, the right panel 
represents the percentage of co-localization of AglZ with respect to AglQ 
(97%, n= 152; two biological replicates), GltD (91%, n = 120; seven 
biological replicates) and GltC (69%, n = 100; three biological replicates), 
respectively. c, Protein co-localization in mobile clusters. Box-plots read 
as in b and describe the percentage of co-localization for proteins detected 
only in dynamic clusters. The left panel illustrates the co-localization of 
AglQ, GltD or GltC with respect to AglZ (76%, n = 106, two biological 
replicates; 39%, n = 11, seven biological replicates; and 0%, n= 4, three 
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biological replicates, respectively). The right panel shows the percentage of 
co-localization of AglZ with respect to AglQ (41%, n= 125; two biological 
replicates), GltD (0%, n = 72; seven biological replicates) and GltC (0%, 
n=A1; three biological replicates). From the two panels, it appears that 
co-localization in dynamic clusters is essentially observed between AglZ 
and AglQ. For GItD and GltC, dynamic clusters are almost never detected 
(n= 11 for GltD, n= 4 for GltC) leading to a co-localization score close 

or equal to 0%. d, AglZ-YFP clusters localize within adhesive contact 
zones. Left, RICM of a representative gliding cell (n= 10, two biological 
replicates, 30-s time frames; scale bar, 2 um) showing intimate connection 
with the chitosan-coated glass surface (dark zone). Right, adhesions and 
AglZ-YFP cluster localization in detaching cells by RICM and combined 
epifluorescence microscopy (time frames, 30 s; scale bar, 21m). The 

graph represents the distribution of RICM intensities at AgIZ-YFP cluster 
positions (red line) compared to the average intensity along the whole cell 
body (black line). Data obtained for n = 20 cells; two biological replicates. 
e, Gliding Myxococcus cells deposit outer membrane vesicles in their wake. 
TIREM images of a motile cell expressing both outer membrane sfGFP 
and inner membrane mCherry. Outer membrane vesicles are deposited, 
suggesting that the cell is firmly adhered to the underlying surface. Shown 
is a representative cell (mn = 60; 12 technical replicates). f, GltD-mCherry, 
a periplasmic Glt protein, is not released by gliding cells at focal adhesion 
complexes. Top, TIRFM snapshots of a representative cell expressing both 
GltD-mCherry (red) and AglZ-YFP (green). The position of the GltD 
clusters on the surface coincides with position of focal adhesion complexes 
(white asterisk). Time frames, 15s; scale bar, 2\1m. Bottom, variation of 
intensity for GltD-mcherry (red) and AglZ-YFP (green) as a function of 
time before (negative time) and after (positive time) the cell moved away 
from the focal adhesion position, shown for n= 10 cells (two biological 
replicates). 
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Extended Data Figure 6 | Image analysis for TIRFM experiments. 

a, Temporal RGB image computed from fluorescence images (mCherry 
for this example). Images were summed together and colour coded from 
blue (first image) to red (last image). Immobile cells appear uniformly 
white whereas moving cells show coloured extremities. Cell trajectory is 
represented by a yellow dotted line. b, For the two imaging channels (YFP 
and mCherry), a kymograph and a montage were calculated. Kkymograph 
reads from top (first image) to bottom (last image), each line representing 
the average fluorescent intensity computed along the cell trajectory. The 
montage shows for each acquisition an image of the cell after applying a 


straightening algorithm. Clusters of proteins (AgIZ-YFP or AglQ/ 
GltD/GltI-mCherry) appear as bright spots at the centre of the cell. 

c, Kymographs after applying a denoising algorithm. The cell outline 

was depicted either by a dotted white line when the cell was immobile 

or by a continuous white line when the cell was gliding on the surface. 

d, Cell speed as a function of acquisition time. For each cell, a threshold 
was defined that depended on the SNR and the sample lateral stability 
during TIRFM acquisition. When the cell speed was below this threshold 
(horizontal dotted line), the cell was considered immobile. Phases 
associated with cell movement are highlighted in blue. 
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Extended Data Figure 7 | Co-localization estimation and correlation 
with cell movement. a, Co-localization between YFP-tagged and 
mCherry-tagged proteins was calculated in three steps. For each channel 
(YFP and mCherry), a threshold was applied to the montage in order 

to detect protein clusters (clusterization). When the same cluster was 
observed in successive images, its localizations were stitched together 
manually and finally classified as either static (red) or dynamic/mobile 
(blue). b, For each channel, a protein-detection map was computed 

from the kymograph and the positions of the detected clusters. The cell 
outline was depicted by two white lines. c, The co-localization map was 
obtained by multiplying the two protein-detection maps. In the inset, the 
cumulative intensities associated with static (red) and dynamic (blue) 
clusters are plotted as a function of time. d, Cell speed as a function of 
acquisition time. Blue boxes represent regions in which cell speed is higher 
than the threshold. e, Cumulative co-localization intensity (static and 
dynamic) as a function of acquisition time. Blue boxes represent regions 
with high cell speeds. f, Curves from d and e were binarized. For the cell 
speed, the value was set to ‘0’ when the speed was below the threshold 
(immobile cell) and to ‘1’ when it was above (gliding cell) (left). For the 
co-localization, the value was set to ‘1’ when co-localization was detected, 
‘0’ otherwise (left). A correlation curve (right) was then computed by 
comparing the two curves. At each time-point, if the values of the two 
binarized curves were equal (1/1 or 0/0, green highlighted areas), the 
correlation was set to ‘1. Otherwise, it was set to “—1’ (red highlighted 
area). Finally, the correlation score was defined as the average of all the 
correlation values. 
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STAUR_RS13230_Stigmatella_aurantiaca_488688940 
STAUR_RS27825_Stigmatella_aurantiaca_488691341 
COCOR_RS16180_Corallococcus_coralloides_504208954 
COCOR_RS09690_Corallococcus_coralloides_504207687 
COCOR_RS12765_Corallococcus_coralloides 504208286 
BDW_RS02800_Bdellovibrio_bacteriovorus_644384878 
EPO1_RS18390_Bdellovibrio_bacteriovorus_915421802 
AA314_03201_Archangium_gephyra_827421679 
AA314_RS19755_Archangium_gephyra_836724663 
D187_RS37700_Cystobacter_fuscus_488703505 
SCE_RS14980_Sorangium_cellulosum_501192530 
SCE1572_RS18295_Sorangium_cellulosum_769241756 
SCE1572_RS03830_Sorangium_cellulosum 522819235 
SCE_RS02950_Sorangium-cellulosum_909620523 
CAP_RS36675_Chondromyces_apiculatus_918245634 
CAP_RS07325_Chondromyces_apiculatus_763380561 
STAUR_RS09640_Stigmatella_aurantiaca 488689222 
MXAN_%331_Myxococcus_xanthus_DK_1622_108466916 
COCOR_01216 Corallococcus_coralloides _DSM_2259_380727937 
BMS_RS01320_Halobacteriovorax_marinus_504008990 
BMS_RS06605_Halobacteriovorax_marinus_504010038 
M899_RS16405_Bacteriovorax_sp_BSW11_1V_ 544858171 
M899_RS08190_Bacteriovorax_sp_BSW11_1V_544856523 
BD_RS06700_Bdellovibrio_bacteriovorus_ 499477323 
BDW_RS04945_Bdellovibrio_bacteriovorus_644387158 
BDT_RS11665_Bdellovibrio_bacteriovorus 499478154 
AA314_RS08895_Archangium_gephyra_836723026 
D187_RS37965,Cysiobacter Tuscus, 488703448 
EP01_RS03085_Bdellovibrio_bacteriovorus_740663405 
consensus/80% 
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CI CsoTonB Box Analog 


Extended Data Figure 8 | Bioinformatics analysis of AglQ/S and GItF. labelled using gene name, organism name and GI. The potential TonB 
a, AglQ and AglS carry a potential peptidoglycan (PG)-binding site. 
Multiple alignment of AglQ, AglS and their paralogues with TolR(Sby4A) an extended region that forms a }-strand-like structure that is not paired 
and ExbD (2pfuA). The gene name, organism name and GI (NCBI) 
accession number are given. The structure is shown on top and the 
80% consensus is shown below the alignment. b, Cartoon views of the sequence profile analysis shows that GItF is related to the N-terminal 
structures are shown below with the residues known or predicted to bind region of certain OMP barrels that do contain a potential TonB-box like 
peptidoglycan shown as sticks. c, The GItF family of proteins found in peptide. 

deltaproteobacteria. Multiple alignment of the GItF family is shown and 


box analogue is indicated. The TonB box can be most generally defined as 


with other 6-strands into a structural unit. The TonB box typically has 
two polar residues, T/S, and classically an acidic/amide residue. The GItF 
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Structure 
NMBH4476_RS08510[1p4tA]_Neisseria_meningitidis_ 488154303 
MXAN_4866[GItH]_Myxococous_xanthus_DK_1622_ 108461169 
MXAN_2540[GItA|_Myxococcus_xanthus_DK_1622~ 108763772 
MXAN_2539[GItB]_Myxococcus_xanthus_DK_ 1622108762511 
MXAN_3378_Myxococcus_xanthus_DK_1622_ 108761772 
MXAN_3372_Myxococcus_xanthus_DK_1622_ 108759830 
MXAN_3371_Myxococcus_xanthus_DK_1622_ 108762939 
MXAN_1329" Myxococcus_xanthus_DK_1622_108759570 
consensus/90% 
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Extended Data Figure 9 | Bioinformatics analysis of the GltA/B/H system. a, Multiple alignment of the 6-barrel OMP proteins GItA, GItB and GItH 
and their paralogues with NspA (1p4tA). Note the presence of TonB box analogues in GItA and GItB but not GItH. b, Cartoon view of the inferred trimer 
of 8-barrels based on the NspA structure with the residues predicted to form a trimeric interaction interface shown as spheres. 
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Doped polymer semiconductors with ultrahigh and 
ultralow work functions for ohmic contacts 


Cindy G. Tang’?*, Mervin C. Y. Ang***, Kim-Kian Choo**, Venu Keerthi*, Jun-Kai Tan’, Mazlan Nur Syafiqah’, 
Thomas Kugler‘, Jeremy H. Burroughes*, Rui-QiPng!’, Lay-Lay Chua!?? & Peter K. H. Ho! 


To make high-performance semiconductor devices, a good ohmic 
contact between the electrode and the semiconductor layer is 
required to inject the maximum current density across the contact. 
Achieving ohmic contacts requires electrodes with high and low 
work functions to inject holes and electrons respectively, where 
the work function is the minimum energy required to remove 
an electron from the Fermi level of the electrode to the vacuum 
level. However, it is challenging to produce electrically conducting 
films with sufficiently high or low work functions, especially for 
solution-processed semiconductor devices. Hole-doped polymer 
organic semiconductors are available in a limited work-function 
range)”, but hole-doped materials with ultrahigh work functions 
and, especially, electron-doped materials with low to ultralow work 
functions are not yet available. The key challenges are stabilizing the 
thin films against de-doping and suppressing dopant migration**. 
Here we report a general strategy to overcome these limitations 
and achieve solution-processed doped films over a wide range 
of work functions (3.0-5.8 electronvolts), by charge-doping of 
conjugated polyelectrolytes*’ and then internal ion-exchange to 
give self-compensated heavily doped polymers. Mobile carriers 
on the polymer backbone in these materials are compensated by 
covalently bonded counter-ions. Although our self-compensated 
doped polymers superficially resemble self-doped polymers®’, they 
are generated by separate charge-carrier doping and compensation 
steps, which enables the use of strong dopants to access extreme 
work functions. We demonstrate solution-processed ohmic contacts 
for high-performance organic light-emitting diodes, solar cells, 
photodiodes and transistors, including ohmic injection of both 
carrier types into polyfluorene—the benchmark wide-bandgap 
blue-light-emitting polymer organic semiconductor. We also show 
that metal electrodes can be transformed into highly efficient 
hole- and electron-injection contacts via the self-assembly of these 
doped polyelectrolytes. This consequently allows ambipolar field- 
effect transistors to be transformed into high-performance p- and 
n-channel transistors. Our strategy provides a method for producing 
ohmic contacts not only for organic semiconductors, but potentially 
for other advanced semiconductors as well, including perovskites, 
quantum dots, nanotubes and two-dimensional materials. 

The usual way of making an ohmic contact involves using suitable 
metals or metal oxides as the electrode, which requires vacuum depo- 
sition and/or a high-temperature post-anneal step. Although charge- 
doped conducting polymers are promising alternatives, the availability 
of p- and n-doped polymer organic semiconductors is limited. Electro- 
oxidation and electro-reduction can generate a range of doped polymers, 
but are not generally practical for device applications. Surface dipolar 
layers, such as perfluorinated ionomers"’ and polyethylenimine"', 
have been reported to extend the work-function range of metal 
oxides, poly(3,4-ethylenedioxythiophene):poly(styrenesulfonic acid) 


(PEDT:PSSH) films and sulfonated poly(thiophene-3-[2-(2-meth- 
oxyethoxy) ethoxy] -2,5-diyl):poly(4-hydroxystyrene) (S-P3MEET: 
PHOST) films, but they can introduce electrochemical effects and 
large electrical resistance, which degrade the quality of the contact’. 

Our self-compensated heavily doped polymer scheme uses a hole- or 
electron-doped polymer organic semiconductor core with charge com- 
pensation by covalently bonded counter-ions (Fig. 1a). Excess bound 
counter-ions can be used together with spectator ions to provide solu- 
tion processability. The chemical structures and designations of these 
and other materials are shown in Extended Data Fig. 1. In contrast 
to self-doped polymers, these self-compensated doped polymers are 
produced by deliberate hole (or electron) doping with strong acceptors 
(or donors), and internal ion exchange to remove the inserted counter- 
ions as soluble salts (Extended Data Fig. 2). These processes can be 
performed in either the film state or the solution state; strong dopants 
can be used in a non-aqueous medium for quantitative doping. Near- 
unity doping efficiencies can be achieved because the doping is made 
irreversible by removal of the dopant product, which is not the case 
when the dopant is incorporated into the organic matrix’. Although 
self-compensation has previously been explored for charge balancing"®, 
its potential to yield practical doped polymers with extreme work func- 
tions was not recognized. By contrast, polyelectrolytes have long been 
proposed as charge-transport layers and work-function modifiers, but 
their electrical resistance without deliberate doping is prohibitively 
high, and their carrier density, work function and charge-injection 
ability are limited*-’. Therefore, ohmic injection of holes or electrons 
into semiconductors with high ionization potential (Ip) or low electron 
affinity (E,), respectively, has not been possible. 

Through the separate doping and compensation strategy, poly- 
mers with high Ip or low E, can be heavily hole- or electron-doped, 
respectively, to reach a doping level well above 10 mol% of the repeat 
units and shift the Fermi level to the relevant band edges. This level of 
doping is different from that used for trap passivation, which requires 
only ultralow doping’**. Because the covalently bonded counter-ions 
prevent centre-of-mass diffusion, the doped carriers cannot diffuse as 
a neutral entity across the doped/undoped junction into the lower- 
energy semiconductor, despite favourable thermodynamics (Fig. 1b). 
This stabilizes the doping profile, that is, the doping density across the 
junction as a function of the depth or distance. To illustrate the stable 
doping profile achieved, the self-compensated hole-doped triarylamine- 
fluorene copolymer with tethered trifluoromethylsulfonylimide 
counter-ion (P4) was deposited on a substrate, and spin-cast with an 
overlayer film of an alkoxy-substituted poly(p-phenylenevinylene) 
(OC,C;9-PPV), which has a lower Ip than P4 (5.0 eV versus 5.8 eV). 
Optical spectroscopy demonstrates that the underlayer remains doped 
while the overlayer remains undoped (Fig. 1c). In contrast, if the coun- 
ter-ion is unbonded, quantitative transfer of doping occurs to the 
OC)Cjo-PPV overlayer film (Extended Data Fig. 3a, b). Such a loss of 
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Figure 1 | Stabilization of the doping profile by counter-ion 
immobilization in self-compensated doped polymer organic 
semiconductors. a, Schematic of self-compensated p- (top) and n-doped 
(bottom) polymer organic semiconductors. The doped polymers 

contain a high density of mobile holes or electrons introduced into their 
m-conjugated cores. The green dashed ellipses highlight the tethered 
counter-ion required for self-compensation of the mobile carrier. The 
purple dashed ellipses highlight the tethered counter-ion and its spectator 
ion (M* or X~) required for processability in solvents. b, c, Schematic (b) 
and experimental data (c) for stabilization against doping transfer from a 
self-compensated hole-doped organic semiconductor layer (OSC-1) with 
higher energy for the charge carriers to an adjacent organic semiconductor 
(OSC-2) with lower energy, during solution processing. Fermi-level 


control of the doping profile occurs whenever mobile counter-ions are 
present, even as impurities (Extended Data Fig. 3c, d). Only when all the 
counter-ions are immobilized will carrier diffusion occur solely across 
the heterojunction to produce the desired charge accumulation layer”. 

Details of synthesis, doping and self-compensation of the exemplified 
materials are given in Supplementary Methods. The doping level of 
these materials was ascertained by optical spectroscopy and/or X-ray 
photoemission spectroscopy (XPS); see Supplementary Discussion 
2.1 and 2.2. Self-compensation was verified by XPS at the film surface 
and by Fourier-transform infrared (FTIR) spectroscopy for the film 
bulk. The basic characterizations of selected materials are reported in 
Supplementary Discussion 2.2 and Extended Data Fig. 4. The vacuum 
work function was measured in the standard way by ultraviolet pho- 
toemission spectroscopy (UPS) from the Fermi level of the film. 

The work function has an electrochemical and a surface-dipole 
contribution, represented by the sum 44+ x, where ju is the electro- 
chemical potential of carriers at the Fermi level, measured from the 
Galvani potential, and x is the surface-dipole potential”°. For doped 
organic semiconductors, the ‘intrinsic component of the work func- 
tion, which is dominated by 1, is governed by the electronic structure of 
the organic semiconductor core; see ref. 21 for examples of hole-doped 
polymers and Extended Data Fig. 4f, j, k for examples of electron-doped 
polymers. However, Coulomb interactions with counter-ions and any 
spectator ions that are present also play an important part, owing to 
poor screening in these materials”!?. On the other hand, the ‘extrin- 
sic component of the work function, given by y, can be manipulated 
through surface ionic layering and dipole effects. 

As an illustration of the effects of the x-conjugated core, self- 
compensated hole-doped triarylamine-fluorene copolymers can 
provide ultrahigh work functions up to 5.8 eV. Self-compensated 
electron-doped naphthalene-bis(carboxyimide)-thiophene copol- 
ymers can provide low work functions down to 3.9 eV. These were 
the first examples of air-stable, heavily n-doped polymers that can 
be processed from solution. The use of fluorene copolymer cores can 
extend the work-function range to 3.0eV under stringent conditions. 

Doped organic semiconductor films with such extreme work func- 
tions are vulnerable to de-doping by impurities such as H2O and Oy, 
which are present even in glove boxes. If the Fermi level of the doped 
film is higher or lower than the redox potential of the O2/H2O couple, 
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(Ep) alignment results in an offset of the vacuum level (VL) due to the 
transfer of carriers at the interface. In the experiment, OSC-2 is poly(2- 
methoxy-5-(3’,7'-dimethyloctyloxy)-1,4-phenylenevinylene) (OC;Cio- 
PPV; Ip + 5.0 eV), spin-cast over a triarylamine-fluorene copolymer film 
with tethered trifluoromethylsulfonylimide counter-ion (P4; OSC-1) 

(Ip + 5.8 eV) that was p-doped to the self-compensated state (doping level 
of x 0.8). These films were probed by optical transmission spectroscopy. 
The difference spectrum (top panel, purple solid line) is obtained from the 
sample spectrum after depositing OSC-2 (bottom panel, green line) minus 
the spectrum before deposition (bottom panel, red line). The spectrum 

of OSC-2 only on glass (top panel, purple dashed line) is shown for 
comparison. No doping transfer to the OSC-2 overlayer has occurred. 


electron transfer can occur to de-dope the film. The standard electrode 
potential for this couple in neutral HO corresponds to a work function 
of 5.3 eV. However, the actual conditions during film formation can be 
very different from the standard conditions used for these potentials. 
Also, the actual mechanism may involve considerable overpotentials, 
as in one-electron oxygen reduction and water oxidation reactions 
(Extended Data Fig. 5). As a consequence, the regime above 5.3 eV— 
which is demarcated by oxidation of neutral HO under standard con- 
ditions and which we denote ‘ultrahigh work function. —and the regime 
below 4.1 eV—which is demarcated by standard reduction of O, to O."~ 
in water and which we denote ‘ultralow work function —may exhibit 
sufficient kinetic stability for device processing before encapsulation. 

The stringent removal of HO is important for the stability of films 
with both ultrahigh and ultralow work functions. H,O promotes the 
oxygen reduction cascade and autoxidation reactions at much less neg- 
ative potentials, which closes the kinetic stability window. Therefore, 
these materials need to be non-hygroscopic. This condition can be 
met practically by selecting minimally hygroscopic counter-ions 
and spectator ions (Supplementary Information 2.3). For example, 
the trifluoromethylsulfonylimide counter-ion imparts considerably 
better ambient stability to the ultrahigh-work-function hole-doped 
p-trifluoromethy]-substituted triarylamine-fluorene copolymer than 
does the more hygroscopic sulfonate ion (Extended Data Fig. 6a, b). 
Furthermore, covalent bonding of the counter-ions confers stability 
against de-doping. For example, the w-trimethylammonioalkyl counter- 
ion imparts considerably better ambient stability to the ultralow- 
work-function electron-doped naphthalenebis(carboxyimide)-thiophene 
copolymer than does unbonded Na‘ or cobaltocenium (Extended Data 
Fig. 6c, d), providing unprecedented, albeit limited, air stability to the 
self-compensated electron-doped naphthalene-bis(carboxyimide)- 
thienothiophene copolymer with tethered trimethylammonium 
counter-ion (N1; which has a vacuum work function of 4.0 eV). This 
ability of the tethered counter-ions to improve polymer stability 
is attributed to suppression of the ionic conduction mechanism that is 
needed to compensate electronic de-doping of the polymer””’. 

These design considerations enable a wide variety of self-compensated 
heavily doped polymer organic semiconductors to be made to 
exhibit ultrahigh and ultralow work functions. With these materials, 
ohmic hole and electron contacts can be achieved even for the most 
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Figure 2 | Ohmic charge-injection and charge-collection characteristics. 
a, Current-density—voltage (J-V) characteristics of hole-only diodes (ITO/ 
HIL/PFOP/Mo0Os) with a 170-nm-thick PFOP layer. 30-nm-thick p-doped 
P4 (blue line), p-doped P1 (green line) or PEDT:PSSH (orange line), 
spin-cast onto ITO before the deposition of PFOP, was used as the HIL in 
these test diodes. 10-nm-thick MoO; capped with Ag was evaporated onto 
the top of the PFOP layer to give the control ohmic hole contact. The 

J-V characteristic obtained when biased for hole injection from MoO3 

is shown for comparison (red line). b, J-V characteristics of electron- 

only diodes (ITO/PEDT:PSSCs/PFOP/EIL) with a 120-nm-thick PFOP 
layer. 50-nm-thick p-doped PEDT:PSSCs, spin-cast onto ITO before the 
deposition of PFOP, was used as the hole-blocking contact. 20-nm-thick 
N2, either undoped (green line) or n-doped by contact with sodium 
naphthalenide and capped with evaporated Ag (blue line), was used as 

the EIL, which was spin-cast onto the top of the PFOP layer. 30-nm-thick 
Ca, capped with Al, was evaporated onto the top of the PFOP layer to give 
the control ohmic electron contact (red line). c, J-V characteristics of 
bipolar diodes (ITO/PEDT:PSSH/DPPT2-TT/EIL) with a 110-nm-thick 
DPPT2-TT layer. 50-nm-thick p-doped PEDT:PSSH, spin-cast onto ITO 
before the deposition of DPPT2-TT, was used as the ohmic hole contact. 
5-nm-thick n-doped N1 capped with Ag, spin-cast onto the top of the 
DPPT2-TT layer, was used as the EIL in the test diode (blue line). 100-nm- 
thick Ag (red line) and Al (green line), evaporated onto the top of the 
DPPT2-TT layer, were used as controls. d, J-V characteristics of solar cells 
(ITO/HCL/PBDTTPD:PCBM/Ca) with a 90-nm-thick PBDTTPD:PCBM 
(at a weight ratio of 1:1.5) photoactive layer, under 100 mW cm 
illumination (spectral-mismatch corrected). 40-nm-thick p-doped 
PEDT:PSSH (red line) or 30-nm-thick p-doped P1 diluted in PHOST 

(at a weight ratio of 1:3) (blue line), spin-cast onto ITO before the 
deposition of the photoactive layer, was used as the HCL. 30-nm-thick Ca, 
evaporated onto the top of the photoactive layer, was used as the ECL. 


demanding organic semiconductor benchmark—the wide-bandgap 
poly(9,9-bis(4-octylphenyl)fluorene-2,7-diyl) (PFOP). To demonstrate 
the creation of these contacts, we use unipolar single-carrier device 
protocols, thus avoiding any ambiguity related to the confinement of 
opposite carriers. For example, p-doped P4 and MoO; are equally as 
effective at providing ohmic hole injection into PFOP (Ip ~5.8 eV) 
when used as the hole-injection layer (HIL) in the hole-only device 
ITO/HIL/PFOP/MoO3;/Ag (where ITO is indium tin oxide). An evap- 
orated 10-nm-thick MoO; film provides an authentic top ohmic hole 
contact in these devices as reference. For device fabrication details, see 
Supplementary Methods. The current density is essentially the same 
whether biased for hole injection from P4 or MoO; (Fig. 2a). The 
current-density—voltage (J-V) characteristics exhibit the ideal form 
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Figure 3 | Blue-light-emitting diode with ultrahigh- and ultralow-work- 
function charge injection layers. a, Current-density—voltage (J-V; filled 
symbols, left axis) and luminance-voltage (L-V; open symbols, right axis) 
characteristics for a 90-nm-thick PFOP light-emitting diode (ITO/HIL/ 
PFOP/EIL). 30-nm-thick p-doped P4 and 20-nm-thick n-doped N3, spin- 
cast before and after the deposition of the PFOP layer, were used as the 
HIL and EIL, respectively, in the test diode (blue symbols). A diode with 
40-nm-thick p-doped PEDT:PSSH, spin-cast before the deposition of the 
PFOP layer, and 30-nm-thick Ca, evaporated onto the PFOP layer, was 
used as control (red symbols). All diodes were capped with Al. 

b, Luminance efficiency versus voltage for the diodes. 


Jc V’) for space-charge-limited current conduction. Whereas MoO; 
has to be deposited in high vacuum, P4 can be deposited from solution, 
before or after the semiconductor layer as desired and even under ambi- 
ent conditions, which increases the flexibility of device design. When 
P4 (work function of 5.8 eV) is substituted by the self-compensated 
hole-doped triarylamine-fluorene copolymer with tethered sulfonate 
counter-ion (P1) or by PEDT:PSSH (work functions of 5.4eV and 
5.1 eV, respectively), the hole-injection efficiency is greatly reduced. 
The effective work function evaluated in situ from built-in potential 
(Vpi) measurements using Stark-effect electro-absorption spectroscopy 
is broadly consistent with the vacuum work function (Supplementary 
Discussion 2.4 and Extended Data Fig. 7). 

In parallel experiments, a self-compensated electron-doped 
benzothiazole-fluorene copolymer with tethered trimethylammonium 
counter-ion (N2; vacuum work function of 3.9 eV) is nearly as effec- 
tive as Ca at providing ohmic electron injection into PFOP (estimated 
E, 2.7 eV) when used as the electron-injection layer (EIL) in the 
electron-only device ITO/PEDT:PSSCs/PFOP/EIL/Ag (where 
PEDT:PSSCs is poly(3,4-ethylenedioxythiophene):poly(styrenesul- 
fonate cesium)). The PEDT:PSSCs layer (vacuum work function of 
4.5 eV) is selected to be hole-blocking to ensure a unipolar device. An 
N2 layer was spin-cast onto the PFOP as a putative EIL. It was left 
undoped, or n-doped with a dilute solution of sodium naphthalenide, 
and capped with evaporated Ag. Control devices were fabricated by 
evaporating Ca on PFOP. The J-V characteristics show that n-doped 
N2 and Ca, but not undoped Nz2, are able to inject electrons into the 
semiconductor (Fig. 2b). The J-V characteristics are consistent with 
trap-limited space-charge-limited current; that is, J=(V — V.,)" where 
V, & Vi is the bare potential and n = 4.8 (ref. 24). 

n-doped NI can be used with p-doped PEDT:PSSH to provide ohmic 
electron and hole contacts to low-bandgap organic semiconductors 
under ambient processing conditions. In combination with PEDT:PSSH, 
N1 is equivalent to or better than Al as an EIL on poly(2,5-bis 
(alkyl)-1,4-dioxopyrrolo[3,4-c] pyrrole-3,6-diyl-thiophene-2,5-diyl- 
thieno[2,3-b]thiophene-2,5-diyl-thiophene-2,5-diyl) (DPPT2-TT) 
in the bipolar diode structure ITO/PEDT:PSSH/DPPT2-TT/EIL/ 
Ag (Fig. 2c). DPPT2-TT is an important prototype of the low-bandgap 
DPP polymers (Ip + 5.15 eV; estimated E, + 3.5 eV)*°. n-doped 
N1 can also be an efficient bottom electron-collection layer (ECL) 
for poly(3-hexylthiophene):phenyl-C.;-butyrate methyl ester 
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Figure 4 | Facile electrode differentiation by self-aligned assembly 

of ultrahigh- or ultralow-work-function interlayers. a, Schematic of 

the FET structure. Grey, Ag gate electrode; white, 450-nm-thick PMMA 
dielectric; pink, 50-nm-thick DPPT2-T organic semiconductor; magenta 
(in test devices), doped contacts imposed by a self-assembled ultrahigh- or 
ultralow-work-function interlayer; yellow, Au source/drain electrodes; 
blue, glass. b, c, Transfer characteristics on linear (b) and semi-logarithmic 
(c) scales. A monolayer of p-doped P1 (middle) or n-doped N1 (bottom) 
was self-assembled onto the Au electrode array from dilute solution. 


(P3HT:PCBM) solar cells in the inverted architecture with PEDT:PSSH 
as top hole-collection layer (HCL) (Extended Data Fig. 8). This opens 
up the possibility of fabricating inverted organic and other solar cells 
on flexible metal foils. 

p-doped P1 outperforms the usual PEDT:PSSH as the HCL in model 
poly(5,6-dihydro-5-alkyl-4,6-dioxo-4H-thieno[3,4-c]pyrrole-1,3- 
diyl-4,8-bis(alkoxy)-benzo[1,2-b:4,5-b’]dithiophene-2,6-diyl):phenyl- 
C¢)-butyrate methyl ester (PBDTTPD:PCBM) organic solar cells of the 
form ITO/HCL/PBDTTPD:PCBM/Ca/Al, with PBDTTPD:PCBM at 
a weight ratio of 1:1.5. PBDTTPD is an example of a donor polymer 
with a higher Jp and, hence, better ambient stability than regioreg- 
ular P3HT*®. The cells with P1 outperform those with PEDT:PSSH 
(Fig. 2d). Higher fill factor (0.60 versus 0.50; standard error of +0.02, 
n= 8), open-circuit voltage (0.93 V versus 0.90 V) and power conver- 
sion efficiency (6.8% versus 5.2%) were obtained without any particular 
morphology optimization of the photoactive layer. The improvements 
in open-circuit voltage and, in particular, fill factor are characteristic 
signatures of the reduced contact resistance in highly ohmic contacts 
(J.-K.T., R.-Q.P. and P.K.H.H., unpublished results). These results 
demonstrate that, to achieve the required ohmic contacts and best 
performances from state-of-the-art photoactive layers, HCL materials 
with ultrahigh work function need to be used instead of PEDT:PSSH. 

Ina final experiment on diodes, we demonstrate solution-processed 
ohmic contacts to a model blue-light-emitting polymer diode. The 
device protocol is ITO/HIL/PFOP/EIL/AlL. A 90-nm-thick PFOP layer 
was used as the model polymer organic semiconductor, 30-nm-thick 
p-doped P4 as the HIL and 20-nm-thick self-compensated electron- 
doped fluorene polymer with tethered trimethylammonium counter- 
ion (N3; vacuum work function of 3.0eV) as the EIL, capped with Al. 
This test diode gives a larger (by about 0.3 V) V,; and a higher current 
density when plotted against V— V;;, compared to the control device 
with PEDT:PSSH as the hole contact (which is blocking) and evapo- 
rated Ca as the EIL (Fig. 3a). The test diode exhibits more balanced 
electron-hole injection, which produces a 1.5-order-of-magnitude 
improvement in electroluminescence efficiency (2-3 cd A!) over 
the control device (Fig. 3b). This, together with the demonstration of 
efficient DPPT2-TT bipolar diode, demonstrates that fully solution- 
processed p-i-n structures can be produced. The deposition of the 
electron-doped polymer overlayer is compatible with the hole-doped 
polymer underlayer. Also, the doped layers do not degrade the electro- 
luminescence properties of the sandwiched polymer organic semicon- 
ductor layer. These features are possible only because of the counter-ion 


LETTER 


T T T T T T 
10 O,-treated Au 
1 A 
0.1 4 
1 1 1 1 1 
T T 7 T T 
10 p-doped P1 : 
1 
o% Seamer 
1 i. 1 
T T T T T T 
10 n-doped N1 = [ 
1 ql [ 
01 lan, E i 1 l 
1 1 1 L 1 2 4 6 8 


40 
Gate voltage (V) 


Channel length, 100|1m; channel width, 3 mm. The drain voltage Vq 

was stepped from 0 V (red) to —20 V (magenta) in steps of —2.5 V. An 
O3-treated Au electrode array (top) was used as the control. The yellow 
shading in c marks the subthreshold region of the control device to 
emphasize the threshold shifts in the test devices. d, Schematic (top left), 
photograph (top right) and characteristics (bottom) of a complementary 
metal-insulator-semiconductor inverter. Common V,, = 0.0 V, supply 
voltage Vaq = 5.0 V; Vin, input voltage; Vou, output voltage. 


immobilization that prevents doping-profile migration and mutual 
annihilation of the opposite carriers. 

Next we demonstrate the ability of self-assembled ultrahigh- and 
ultralow-work-function polymer interlayers to differentiate regular 
metal electrodes, such as Au and Ag, into excellent hole and electron 
contacts, respectively, in field-effect transistors (FETs). The top-gate 
bottom-contact FETs tested comprise Au source-drain electrodes fab- 
ricated on a glass substrate, poly(2,5-bis(alkyl)-1,4-dioxopyrrolo[3,4-c] 
pyrrole-3,6-diyl-terthiophene-2,5”-diyl) (DPPT2-T) semiconductor, 
poly(methyl methacrylate) gate dielectric and Ag gate electrode (Fig. 4a). 
Before deposition of the semiconductor, the Au electrodes were optionally 
contacted with a solution of p-doped P1 or n-doped N1 to self-assemble 
a doped-polymer monolayer that is a few nanometres thick. This 
assembly occurs over only the patterned metal electrodes, but not the 
channel, owing to image-charge electrostatic attraction. 

The control FET with bare Au source-drain electrodes (ozone- 
cleaned) exhibits ambipolar behaviour with field-effect mobilities of 
1.0cm?V~!s~1 and 0.5cm?V~'s~? for holes and electrons, respec- 
tively. Insertion of P1 suppresses n-channel behaviour through a 10-V 
upshift of the gate-source V,, thresholds, whereas insertion of N1 sup- 
presses p-channel behaviour by a 15-V downshift of the V,, thresholds 
(Fig. 4b, c). Such large shifts in both directions cannot be obtained with 
the usual thiol self-assembled monolayer treatment of Au”’. Although 
FETs are less sensitive than diodes to their contacts, owing to carrier 
accumulation in their channels”°, switching the work function across 
a 1.5-eV-wide range here is sufficient to convert the electrode from 
a good hole contact to a good electron contact, thereby differentiat- 
ing ambipolar FETs into p- FETs and n-FETs. This switching is also 
observed for Ag electrodes, despite their intrinsically better electron- 
injection properties (Extended Data Fig. 9). It is thus a general approach 
to contact modification. This ability to manipulate the work function 
of metal electrodes will help to realize optimal contacts also to two- 
dimensional semiconductors”. 

The differentiated p-FET and n-FET can be combined to give an 
inverter—the most basic element of a complementary metal—insulator- 
semiconductor circuit (Fig. 4d). The device shows clean inversion 
operation, no hysteresis, and a midpoint voltage of 5 V that is slightly 
displaced by asymmetry effects. Other possible applications include 
vertical transistors and light-emitting transistors”’. 

To alleviate the stringent atmospheric control that is needed to 
realize n-doped films with work functions of less than about 3.5 eV, 
the self-compensation strategy can be combined with precursor 
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n-dopants to generate the electron-doped state in the final device. As 
an illustration, a composite film of (4-(1,3-dimethyl-2,3-dihydro-1H- 
benzoimidazoyl-2-yl)phenyl)dimethylamine (DMBI-H) dispersed 
in a naphthalene-bis (carboxyimide)-thiophene core with teth- 
ered trimethylammonium counter-ion (N4) yields strong electron- 
doping of the naphthalenebis(carboxyimide)-thiophene core when 
warmed (doping level of 1.0 electrons per repeat unit), whereas 
the corresponding composite film with the same naphthalene-bis 
(carboxyimide)-thiophene core but without the tethered counter-ion 
(NO) yields only weak doping (0.1 electrons per repeat unit) under 
identical conditions (Supplementary Methods and Extended Data 
Fig. 10). This reaffirms the role of the immobilized counter-ions in 
stabilizing doped states with extreme work functions. 

Finally, we speculate about whether x plays any part in the effec- 
tive work function of these doped polymer heterojunctions where the 
polaron carriers co-exist with a high density of spectator ions. The Vij 
measured in some diodes suggests that the effective work function can 
be favourably shifted by up to a few tenths of an electronvolt compared 
to the vacuum work function. For example, the n-doped N1 film ina 
DPPT2-TT diode (Fig 2c) exhibits an apparent decrease in effective 
work function by 0.3 eV, while a p-doped para-methoxy-substituted 
triarylamine-fluorene copolymer film in a PFOP diode exhibits an 
apparent increase in effective work function by 0.2 eV. These shifts 
point to a build-up of a favourable interfacial dipole at the hetero- 
junction, perhaps by an angstr6m-scale displacement of the tethered 
counter-ions that, surprisingly, reinforces the extreme work function 
that is sought after in these films. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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Extended Data Figure 1 | Chemical structures of materials used. The value of x denotes the doping level, in carriers per repeat unit (r.u.), with x =0 
denoting an undoped polymer and x= 1 a fully doped polymer. 
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Extended Data Figure 2 | Schematic of the preparation of self- polymer as a solid, which can be dissolved in the desired solvent. Bottom, 
compensated heavily doped polymer organic semiconductors. Top, film-state doping method. The undoped polymer film is deposited on the 
solution-state doping method. The undoped polymer dissolved in a desired substrate, doped by contact with a dopant solution, and purified by 
suitable solvent is doped in solution by the desired dopant, and purified repeated contact with a non-solvent that dissolves the unwanted ions. This 


by repeated precipitation with a non-solvent while keeping the unwanted directly yields the fully self-compensated doped polymer film. 
ions in solution. This process yields the fully self-compensated doped 
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Extended Data Figure 3 | Doping-profile migration in the presence semiconductor layer (OSC-2) during solution processing. OSC-1 is 
of mobile counter-ions. a, b, Schematic (a) and experimental data (b) incompletely self-compensated hole-doped P1 (x ~ 0.8) containing 50% 
for transfer of holes and counter-ions from a hole-doped organic residual Na* and SbF. OSC-2 as in a and b. Nearly complete transfer of 
semiconductor layer with mobile counter-anions (OSC-1) to an adjacent holes and counter-anions from the doped underlayer to the OC;C,9-PPV 
organic semiconductor layer (OSC-2) during solution processing. OSC-1 overlayer occurred during spin-casting, as evidenced by emergence of 
is PO (Ip=5.9 eV), hole-doped to x + 0.8, with SbF,” as counter-ion. the polaron band of OC;Cj9-PPV (600-800 nm) and bleaching of the 
OSC-2 is OC{Cio-PPV (Ip=5.0 eV), deposited by spin-casting on polaron band of PO and P1 (>900 nm). This doping transfer is driven by 
p-doped PO. c, d, Schematic (c) and experimental data (d) for transfer thermodynamics from a high-Ip layer to a low-Ip layer. E,, Fermi energy; 
of holes and counter-ions from an incompletely self-compensated VL, vacuum level. 


hole-doped organic semiconductor (OSC-1) to an adjacent organic 
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Extended Data Figure 4 | Characterization of selected self- 
compensated, heavily doped polymers. a-d, Triarylamine-fluorene 
copolymer P1. a, Optical spectra of doped P1 in solution (top), and of 

a P1 film before and after doping (bottom). For solution-state doping, 
0.12 mM r.u. P1 in propylene carbonate was doped with NOSbF, and 
self-compensated, yielding a final doping level of 0.5 holes per r.u. The 
optical spectrum is collected in a cell with a 2.0-mm path length. For film- 
state doping, a 100-nm-thick P1 film was doped with NOSbF, and self- 
compensated, yielding a final doping level of 0.9 holes per r.u. b, c, UPS 
(b) and XPS spectra (c; Nls and F1s) of undoped and doped P1 (x + 0.5) 
films. d, FTIR evidence for full self-compensation in a P1 film. Spectra 
were acquired before doping (red), after film-state doping with 6 mM 
NOSDF, in acetonitrile (green) and after self-compensation by excess ion 
removal (Na* and SbF¢_) via spin-rinse with acetonitrile (blue). Infrared 
active vibration modes (IRAV; 1,100-1,600 cm!) indicate that the film 
is doped. SbF,” (anti-symmetric stretching frequency 14, = 660 cm!) 

is present before, but not after, self-compensation (residual, <1%). 
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e-g, Naphthalenetetracarboxydiimide-thiophene copolymer N1. e, Optical 
spectra of doped N1 in solution (top), and of an N1 film before and 

after doping (bottom). For, solution-state doping, 0.22 mM r.u. N1 in 
dimethylsulfoxide was doped with sodium 9,10-diphenylanthracenide 

and self-compensated, yielding a final doping level of 1.0 electrons 

per r.u. The optical spectrum is collected in a cell with a 2.0-mm path 
length. For film-state doping, a 25-nm-thick N1 film was doped with 
sodium 9,10-diphenylanthracenide and self-compensated, yielding a 

final doping level of 1.0 electrons per r.u. f, g, UPS (f) and XPS spectra 

(g; N1s and I3d5,2) of undoped and doped N1 (x ~ 1.0) films. h-k, Fluorene 
copolymers N2 and N3. h, i, Optical spectra of 40-nm-thick N2 (h) and 
N3 (i) films, before and after film-state doping with sodium naphthalenide 
solution and spin-rinsing with tetrahydrofuran. Final doping levels, 0.9 
electrons per r.u. (N2; h) and 0.4 electrons per r.u. (N3; i). j, k, UPS spectra 
of undoped and n-doped N2 (j) and N3 (k) films. All binding energies 

are referenced to vacuum level. The Fermi energy Er is marked for doped 
films; the ionization potential Ip is marked for undoped films. 
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Extended Data Figure 5 | Electrode potentials of H,O and O2 redox 
couples in pH 7 water, plotted on the vacuum scale. Conditions: neutral 
H,O; pressure, 1 bar (Oz, H2); concentration, 1 M (H,O3, O°", HO’, 
HO,"). The reference voltage (0.00 V) on the standard hydrogen electrode 
(SHE) scale corresponds to 4.46 eV on the vacuum scale. Electrode 
potentials of O2/O2"~ were computed from ref. 31; other electrode 
potentials were computed from ref. 32. p-doped films with ultrahigh 
work functions (UHWE) of greater than about 5.3 eV and n-doped films 
with low work functions (LWF) of less than about 5.3 eV can oxidize 
neutral H2O to Oz, and reduce O; to neutral H20, respectively, and so 
cannot remain indefinitely stable in the presence of these species. If the 
oxygen reduction reaction proceeds through one-electron steps, then 


the relevant first one-electron couple (O2/O,*~ ) lies in the range 
4.15-3.65 eV, depending on the dielectric medium, which opens up a 
stability window for films with ultralow work functions (ULWF). The 
stability of the films with ultralow work functions is ultimately limited by 
the electrode potential at the minimum practical O2 concentration during 
film preparation (p.p.m. range). To overcome this limitation, precursor 
n-dopants can be incorporated into the self-compensated platform (see 
main text). Conversely, the relevant first one-electron couple for water 
oxidation ((H* + HO*)/H,0) lies at 6.75 eV, which opens up a stability 
window for films with ultrahigh work functions. DMSO, dimethyl 
sulfoxide; BMIM BF,4, 1-butyl-3-methylimidazolium tetrafluoroborate. 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


+¥) 


0.15 pristine P4 


p-doped P4 after 
exposure to 
ambient (min): 0 


15 


45 


-log (Transmittance) 
(o>) 
S 


0.05 


ati tii 414 


800 900 


400 500 600 700 
Wavelength (nm) 


c 

0.08 pristine N1 4 
= n-doped N1 after ] 
® exposure to | 
c ambient (min): 0 | 
£0.07 1 
€ 16 | 
2 64 | 
© 0.06 130 1 
S 280 | 
2 | 
T 0.05 1 

00 


400 500 600 700 800 900 


Wavelength (nm) 


Extended Data Figure 6 | Ambient stability of ultrahigh- and ultralow- 
work-function films. a, b, Self-compensated p-doped P4. a, Time- 
dependent optical spectra of a 12-nm-thick film in ambient conditions 
(295K, 65% relative humidity (RH)). The initial doping level was 

x 0.8. b, To compare the stability imparted by different counter-anions, 
the doping level is shown as a function of exposure time for P4 (with 
tethered trifluoromethylsulfonylimide counter-anion; blue) and P3 (with 
tethered sulfonate counter-anion; red). The films were hole-doped with 
1mM NOSDF, in acetonitrile and self-compensated by spin-rinse with 
acetonitrile in a N2 glove box, and then exposed to ambient conditions and 
measured after specified intervals. The doping level was evaluated from 
the integrated polaron band intensity (460-540 nm). Time to 50% doping 
level: P4, 50h; P3, 15 min from initial doping level of about 0.8 holes per 
ru. ¢c, d, Self-compensated n-doped N1. c, Time-dependent optical spectra 
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of a 7.5-nm-thick self-compensated film in ambient conditions. The initial 
doping level was x~ 1.0. d, To compare the stability of self-compensated 
and non-self-compensated films, the doping level is shown as a function 
of exposure time for N1 (with tethered trimethylammonium counter- 

ion; blue) and NO with unbonded Na‘ (red) or Cc* (cobaltocenium; 
green) counter-ions. The films were electron doped with 15 mM sodium 
9,10-diphenylanthracenide in diglyme and self-compensated by spin-rinse 
with acetonitrile. The doping level was evaluated from the integrated band 
intensity (540-700 nm) after calibration. Time to 50% doping level: 

N1, 2h; NO counter-balanced by Cc*, 20 min; NO counter-balanced by 
Na‘, 4min from initial doping level of about 1.0 electrons per r.u. Effective 
ionic radius rion: Na*, 1.0 A; Cc*, 3. A; —(CH2)3NMe3", 3.2 A. De-doping 
recovers the original y-17* spectrum. 
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Extended Data Figure 7 | Apparent work function evaluated by in situ 
built-in potential measurements. a, Stark-effect electro-absorption 
spectra of hole-only diodes with p-doped PEDT:PSSH (left), P2 (middle) 
and P4 (right) as hole contacts, PFOP as the organic semiconductor and Al 
as the electron-blocking contact. Normalized modulated reflection spectra 
(AR/R) are shown for different d.c. biases Vj. applied to the hole contact, 
where AR is the modulated root-mean-square reflectance in-phase with 
the forward-bias half-cycle and R is the d.c. reflectance*’. Negative AR/R 
therefore indicates induced absorption in-phase with the forward-bias 
half-cycle. The dotted line gives the null background. Temperature, 30 K; 
modulation frequency, 535 Hz; root-mean-square voltage Vim, 0.5 V. 

b, Apparent energy-level alignment diagrams deduced from the built-in 
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potential V;j;, assuming rigid alignment at the electron contact. Energies 
are plotted relative to the vacuum level (VL) of the organic semiconductor. 
The Stark band at 2.87 eV near the x—-1* absorption edge of PFOP shows 
polarity inversion at 1.70 V for PEDT:PSSH, at 2.35 V for P2 and at 2.4V 
for P4. Estimated uncertainty, +0.03 V. Consequently, the apparent work 
functions are: PEDT:PSSH, 3.4+ 1.7 =5.1 eV; P2, 3.4+ 2.35=5.75eV; 
P4, 3.4+2.4=5.8eV. In reality, the effective work function at the electron 
contact decreases in response to an increase in work function at the hole 
contact, driving it into the pinned regime. This then gives rise to pinning 
at about 5.6 eV. The apparent work functions therefore confirm that the 
work functions of P2 and P4 are both larger than 5.6 eV, with the work 
function of P4 greater than that of P2. 
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Extended Data Figure 8 | Characteristics of solar cells with an 
ultralow-work-function polymer as the bottom electron-collection 
contact. Current density versus voltage for an 80-nm-thick P3HT:PCBM 
photoactive layer (weight ratio of 1: 0.8) under 100 mW cm *? illumination 
(spectral-mismatch corrected) with an ultralow-work-function polymer 
as the bottom electron-collection contact and PEDT:PSSH as the top 
hole-collection contact. 20-nm-thick n-doped N1 was deposited as the 
ECL via spin-coating from DMSO solution onto the ITO layer. The 
photoactive layer (PAL) was deposited from chlorobenzene. 30-nm-thick 
PEDT:PSSH was deposited as the HCL via spin-coating from aqueous 
solution onto the top of the PAL and capped with evaporated Ag. A control 
cell was fabricated in the same batch by omitting the NI layer, so that ITO 
provided the electron-collection contact. The control cell (red) shows poor 
behaviour with an S-shaped J-V characteristic, whereas the test cell (blue) 
gives good performance similar to standard cells with evaporated Ca as the 
electron-collection contact. Open-circuit voltage V,, is given in V; short- 
circuit current J,. in mA cm’, and power conversion efficiency PCE in %. 
FE, fill factor. Both cells have been corrected for shunt leakage. 
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a, Schematic of the FET structure. b, c, Transfer characteristics on a the expected ambipolar behaviour*’, whereas the modified electrodes 
linear (b) and semi-logarithmic (c) scales. A monolayer of p-doped P2 or show gate-source Vj, threshold shifts, similar to that observed on Au 
n-doped N1 was self-assembled onto the Ag electrode array from dilute electrodes. 
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Extended Data Figure 10 | Integration of precursor n-dopant 
methodology into self-compensated doped polymers. a, b, N4 (a) and 
NO (b) with (blue) and without (red) DMBI-H. Pristine polymer films 
(15-nm-thick N4 and 20-nm-thick NO) and composite films with DMBI-H 
(1:2 mol/mol r.u.) were spin-cast and baked at 120°C on a hotplate, 

and then measured by transmission optical spectroscopy. DMBI-H 

is (4-(1,3-dimethyl-2,3-dihydro-1H-benzoimidazoyl-2-yl)phenyl) 
dimethylamine, a precursor n-dopant*®. c, Authentic n-doping of N4 films, 
as a reference. A 30-nm-thick N4 film was spin-cast and then electron- 
doped to 1.0 electron per r.u. by contact with cobaltocene (Cc; 2mM in 
diglyme). NO has an identical x-conjugated organic semiconductor core to 
N4, but lacks tethered counter-ions. Whereas N4 exhibits strong doping, 
NO exhibits only weak doping under identical conditions, possibly owing 
to the lower stability of its n-doped state. 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


LETTER 


doi:10.1038/nature20135 


The evolving quality of frictional contact 


with graphene 


Suzhi Li!*?, Qunyang Li*°, Robert W. Carpick®, Peter Gumbsch”*, Xin Z. Liu®, Xiangdong Ding!, Jun Sun)? & Ju Lib?? 


Graphite and other lamellar materials are used as dry lubricants 
for macroscale metallic sliding components and high-pressure 
contacts. It has been shown experimentally that monolayer 
graphene exhibits higher friction than multilayer graphene and 
graphite, and that this friction increases with continued sliding, 
but the mechanism behind this remains subject to debate. It has 
long been conjectured that the true contact area between two 
rough bodies controls interfacial friction’. The true contact 
area, defined for example by the number of atoms within the 
range of interatomic forces, is difficult to visualize directly but 
characterizes the quantity of contact. However, there is emerging 
evidence that, for a given pair of materials, the quality of the 
contact can change, and that this can also strongly affect interfacial 
friction’~’. Recently, it has been found that the frictional behaviour 
of two-dimensional materials exhibits traits* '> unlike those of 
conventional bulk materials. This includes the abovementioned 
finding that for few-layer two-dimensional materials the static 
friction force gradually strengthens for a few initial atomic periods 
before reaching a constant value. Such transient behaviour, and the 
associated enhancement of steady-state friction, diminishes as the 
number of two-dimensional layers increases, and was observed 
only when the two-dimensional material was loosely adhering 
to a substrate®. This layer-dependent transient phenomenon has 
not been captured by any simulations!*"’. Here, using atomistic 
simulations, we reproduce the experimental observations of layer- 
dependent friction and transient frictional strengthening on 
graphene. Atomic force analysis reveals that the evolution of static 
friction is a manifestation of the natural tendency for thinner and 
less-constrained graphene to re-adjust its configuration as a direct 
consequence of its greater flexibility. That is, the tip atoms become 
more strongly pinned, and show greater synchrony in their stick- 
slip behaviour. While the quantity of atomic-scale contacts (true 
contact area) evolves, the quality (in this case, the local pinning 
state of individual atoms and the overall commensurability) also 
evolves in frictional sliding on graphene. Moreover, the effects can 
be tuned by pre-wrinkling. The evolving contact quality is critical 
for explaining the time-dependent friction of configurationally 
flexible interfaces. 

We conducted molecular dynamics simulations by sliding a silicon 
tip over graphene supported on an amorphous Si (a-Si) substrate 
(see Methods). Substrate-graphene adhesion was modelled as a van der 
Waals interaction with an effective work of adhesion of approximately 
0.1J m7’, based on experimental values!®. Figure 1a shows the relaxed 
system consisting of a-Si and a monolayer of graphene at 300 K. As 
in experiments’®, the modelled surface morphology of graphene 
conformed well to the a-Si substrate (Fig. 1b and c). A rigid round 


(001)-oriented crystalline silicon tip with a radius of 16.3 nm was 
placed in contact with graphene (Fig. 1d) and equilibrated for 
2ns (Fig. le and f). Graphene’s high out-of-plane flexibility coupled 
with tip-graphene adhesion causes local puckering near the contact 
edge (Fig. le, arrow). A similar puckering phenomenon was also 
found for the multilayer graphene systems (see Supplementary 
Discussion 1). 

We then applied an external normal load of 0.8 nN to the tip and 
further relaxed the whole system. After relaxation, we displaced a 
harmonic spring laterally coupled to the tip at 2m s~'. It is noted that the 
normal load was stably maintained at 0.8 nN during the friction simula- 
tion (see Supplementary Discussion 2). As the tip slid on graphene (with 
layer numbers N= 1-4), clear stick-slip motion with two distinct stages 
was observed in all cases (Fig. 1g), in contrast to conventional atomic 
stick-slip on bulk graphite”. Initially, the local peaks of the lateral force 
(that is, the static friction force, when slip initiates) increased progres- 
sively with each slip. After a few periods, the behaviour became regular, 
and the peak forces remained constant. The slip distance in each stick- 
slip event is approximately 2.5 A, equivalent to the lattice spacing of 
graphene. The simulations consistently reproduce, for the first time, the 
two-stage friction behaviour observed in experiments’. 

Our simulations also show layer-dependent friction®, that is, the 
strengthening effect became weaker and the frictional force reduced 
as sample thickness increased; the strengthening almost vanished 
once the thickness reached four layers. To check that two-stage fric- 
tion was not a dynamic effect caused by the high sliding speed, we first 
reduced the speed to 1m s~‘ and 0.5m s~‘ and found that there is only 
a weak dependence of the frictional strengthening on sliding speed (see 
Supplementary Discussion 3). In addition, we performed quasi-static 
calculations~’* and found that the two-stage friction trait was retained 
(see Supplementary Discussion 4). Finally, we computed the energy 
barriers for stick-slip motion, and confirmed that the stick-slip event is 
not driven primarily by thermal fluctuations at experimental time and 
force scales, but primarily by the external forcing (see Supplementary 
Discussion 5). 

Early studies attributed the friction enhancement for thinner two- 
dimensional samples to puckering*'*’, where the sliding tip induces 
more out-of-plane deformation for thinner samples, leading to a larger 
true contact area and thereby a larger friction. Here, we also observed 
noticeable puckering (Fig. le, Supplementary Video) and found that 
the puckered configuration evolved as the tip moved forward. Figure 1h 
shows the average friction force together with the contact area in the 
constant static friction force regime (that is, beyond the initial strength- 
ening regime) for different layered graphene samples. The simulations 
show that a larger contact area corresponded to larger friction, and was 
correlated with increased puckering. However, the friction increase 
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Figure 1 | Model setup and frictional behaviour for a Si tip sliding over 
a graphene/a-Si substrate system at 300 K. a, Graphene adhering to the 
a-Si substrate. b, c, Surface morphology of monolayer graphene (b) and 
the substrate (c). d, A rigid Si tip is placed in contact with graphene for the 
friction tests. e, f, Morphology of monolayer graphene (e) and substrate (f) 
in the presence of the tip after 2 ns. The lateral dimensions of the substrate 
and graphene are 43.4nm x 43.4nm and 38.5nm x 38.6 nm, respectively. 
g, Force traces showing stick-slip behaviour on single (1L) and multilayer 


(an 80% increase for monolayer versus four-layer graphene, 
consistent with experiments) was substantially larger than the change 
in contact area (14%). Therefore, the true contact area (which by 
definition is geometric and additive) change caused by the puckering 
of graphene alone cannot account for the large increase in friction. This 
strongly suggests the existence of additional mechanisms contributing 
to the layer dependence of friction and the transient strengthening. 

To trace the physical processes at play, we analysed the distribution 
of atomistic interaction forces at the interface. For the tip, the net 
lateral interaction force from graphene, Finteract, is the sum of 
the x components of all atomic forces exerted by graphene, Finteract= 
pr ai =, (f, spr) —f,(0)) , where f,(xspr) is the x component 
of the graphene-tip interaction force for tip atom i when the spring 
moves a distance x,p, and f,(0) is the interaction force before sliding. 
To highlight the resultant forces due to sliding, we considered only 
the incremental interfacial forces with respect to the initial state (see 
Supplementary Discussion 6). 

We first performed interfacial force analysis for monolayer graphene. 
Figure 2a—d shows the f ical distributions at four moments, each 


when the lateral force reached its local peak values during the stick-slip 
friction, marked i, ii, iii and iv in Fig. 1g. Owing to the crystallographic 
and geometrical differences between graphene and silicon, f ae 
values at all four moments exhibited a seemingly random distribution; the 
total sum in each case was negative (thus producing frictional resistance). 
The f i distribution clearly evolved from points i to iii (Fig. 2a—c) and 
was almost unchanged between points iii and iv (Fig. 2c and d). 

We individually followed f ‘apie ofa few tip atoms that were acting as 
pinning sites or ‘traps’ (see Supplementary Discussion 7). For some of 


these atoms, the absolute value of f ae increased in the strengthening 
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O mmm 8.0A 


Layer number 


(2L-4L) graphene/a-Si substrates. h, Variations of averaged friction (black) 
and contact area (grey) with number of layers of graphene. The black, red 
and gold atoms in a and d refer to the a-Si substrate, graphene and the tip, 
respectively; atoms in b, c, e and f are coloured according to the height 
amplitude along the y direction. The tip-graphene contact area is taken 

to be ms, where m is the number of graphene atoms that are in intimate 
contact with the tip atoms and s (2.77 A? per atom) is the atomic area of 
graphene (see Methods for details). 


stage and then saturated. This indicates that pinning gets progressively 
stronger at these sites with each slip. These atoms apparently provide a 
substantial contribution to frictional strengthening. For most atoms, 
the variation of f'"“°" was steady, indicating no pinning enhancement. 
These atoms contribute to the overall friction, but not to the initial 
transient strengthening by local trap deepening. 

Further inspection shows that the variation of the interfacial force 
is closely related to the local configuration of the contacting graphene. 
If the environment near a tip atom allows the graphene to locally 
adjust its atomic configuration, the interfacial force will exhibit an 
evolution effect. However, the detailed local contact environment varies 
from atom to atom owing to complex tip, graphene and rough substrate 
configurations. We cannot tell beforehand how an individual tip atom 
and its surrounding would evolve, because they not only depend on the 
initial configuration but also depend on the randomly rough surface and 
the complicated deformation history (for example, the statistical features 
of the final configuration depend on the entire sliding path, and not 
just on the initial and final tip locations). Nevertheless, our simulations 
clearly demonstrate that the interface tends to evolve towards a more 
commensurate (that is, with more atomic alignment at the interface) and 
better-pinned state (that is, it finds a deeper energy trap). 

To quantify this, we plotted the histogram of the atomic interac- 
tion forces at these four moments in Fig. 2e. All four histograms have 
larger populations at low-magnitude interaction forces and much lower 
populations at high magnitude. However, these histograms were all 
skewed negatively because the overall net force was negative. More 
importantly, as indicated by the insets of Fig. 2e, the range of the 
distribution increased appreciably from points i to iv (for example, the 
maximum forces are around +0.6 nN at point i, but increase to +0.8nN 
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Figure 2 | Evolution of the atomic-level forces contributing to friction 
on a monolayer graphene/a-Si substrate. a—d, Corresponding f a 
distribution at the four moments marked in Fig. 1g. The sites with positive 
magnitude of f ae (red colour) are the local pushing points that help the 
tip slip forward (to the right), while the sites with negative magnitude of 
f Ne (blue colour) are the local pinning points that produce lateral 
resistance. e, The histogram of ft (7) at each of the four moments. 


at points iii and iv). This local force enhancement is consistent with the 
higher contrast of interfacial forces shown in Fig. 2a—d, which confirms 
the abovementioned local trap deepening. In addition, geometrical 
analysis also suggests that the tip-graphene interface evolved through 
small atomic shifts to produce more intimate atomic contact as the tip 
was displaced forward, providing more atomic pinning sites for lateral 
sliding (see Supplementary Discussion 8). 

We further analysed the evolving commensurability of the interface. 
A less structured contact interface should have a unimodal force 
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Figure 3 | Simulations of stick-slip friction on monolayer suspended 
graphene at 300K. a, Simulation model. Three graphene samples 

with different amounts of pre-existing wrinkles are generated, named 
G1, G2 and G3. The colours represent the out-of-plane height amplitude. 


The histogram is obtained by dividing the range of interfacial forces into 
twenty bins and plotting the number of tip atoms with interaction forces 
within each bin. The insets provide a magnified view of the distribution 
in the tails. f, The kurtosis value for the four force distributions at the 
four moments. The inset shows the force distributions that were used to 
calculate kurtosis. p(n) is the normalized atom number in each 
corresponding bin of interfacial forces. 


distribution, similar to a Gaussian or even a uniform distribution. 
A very effective parameter with which to describe the deviation from 
a unimodal distribution (see Supplementary Discussion 9) is the 
kurtosis K = 1,/0+, where j14 is the fourth moment about the mean 
value and a is the standard deviation of the distribution (see Methods). 
The larger K, the more structured the distribution becomes, and the 
more commensurate the interface. Figure 2f show the kurtosis values 
for points i to iv. The kurtosis increased from point i to point iii and 
was almost unchanged between points iii and iv. This provides an 
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The induced peak-to-peak height undulations of G2 and G3 are 4.9 A and 
9.0 A, respectively. b, Lateral force traces showing friction sliding on the 
three samples. c, Variation of tip~graphene contact area during sliding. 
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Figure 4 | Evolution of the atomic-level forces contributing to friction 
on monolayer suspended graphene. a-c, Comparison of f inichion 
distribution in the three samples of Fig. 3b. d, The histograms of f fiction (4) 
for the three samples. The insets provide a magnified view of the 


indication that the tip-graphene interface became slightly more 
commensurate with each slip initially, and finally reached a stable value. 

The simulations unambiguously show that the friction strengthening 
originates from two key mechanisms: enhancement of the local 
pinning (some individual atoms becoming more strongly pinned), 
and an increase in the interfacial commensurability (an increase in 
the number of atoms that are pinned in perfect synchrony). This is, to 
some extent, consistent with simulations of three-dimensional mate- 
rials showing that interfacial commensurability, either through lattice 
matching”? or through interaction-enhancing contaminant species, 
greatly increases static friction”!. However, the evolution of interfacial 
friction and contact quality here is a result of high configurational free- 
dom of graphene owing to its bending flexibility. 

We then performed similar force analysis for multilayer (N = 2-4) 
graphene systems (see Supplementary Discussion 10) and found that 
the variation of the interfacial pinning and overall commensurability 
also contributed greatly to the layer dependence of friction, explaining 
the large quantitative disparity between the friction enhancement and 
true contact area increase in Fig. 1h. However, in the multilayer cases, 
the top graphene layer forms a perfect AB stacking with respect to the 
lower graphene layer. As observed in several experiments”, perfectly 
aligned stacking can strongly constrain the relative lateral movement 
of graphene layers. This enhanced constraint reduces the ability of 
graphene to adjust its configuration during the sliding process. 

Since the contact interface with graphene can be affected by pre- 
existing wrinkles’, we speculate that regulating the degree of wrinkling” 
may offer an effective means of tuning friction. To test this, we studied 
a model system by sliding a tip over freely suspended graphene with 
various degrees of pre-existing wrinkling (Fig. 3a), formed by applying 
different amounts of bi-axial compressive strain (€,, and ¢,,). Three 
samples G1, G2 and G3 with 0%, —0.2%, and —0.5% pre-compressive 
strain were studied in the simulations. As expected, the relaxation state 
of graphene profoundly affects its friction (Fig. 3b). Compared to G1, 
relaxed samples G2 and G3 exhibited substantially enhanced static 
friction (up to about 300%), although the increase in true contact 
area was much smaller (up to about 1.5%, see Fig. 3c). The ff eachon 
distributions at the local force peaks of these three samples (Fig. 4a-c) 
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distribution in the tails. e, The kurtosis value of the distributions of 

f ‘ction (7) for the three samples. The inset shows the force distributions 
that were used to calculate the kurtosis. p(n) is the normalized atom 
number in each corresponding bin of interfacial forces. 


and their histograms (Fig. 4d) indicate that, as pre-compressive strain 
increased, stronger pinning sites with higher local interfacial forces 
appeared. Furthermore, the kurtosis analysis shows that the interfacial 
force distribution increasingly deviates from a unimodel Gaussian 
distribution from G1 to G3 (Fig. 4e), indicating a more commensurate 
interface with increasing pre-compressive strain. 

As confirmed by the simulations, friction on suspended graphene 
can be effectively tuned by pre-compression and the ‘quality’ of the 
interfacial contact. But unlike the supported cases, the interface reached 
its steady configuration immediately after the tip made contact, because 
of larger excess area and resultant higher configurational flexibility. 
Thus, no frictional strengthening was observed in these simulations, 
consistent with the observations of Deng et al.''. We also performed 
simulations for graphene strongly bound to a flat surface. Owing to 
the much reduced freedom for graphene to configurationally evolve, 
no frictional strengthening or layer dependence occurred in this case 
either (see Supplementary Discussion 11), consistent with experimental 
measurements of strongly bound graphene*™*, 

In conclusion, we have identified a complementary aspect to the 
concept of true contact area governing friction on two-dimensional 
materials, related to evolving configurational relaxations that exploit 
out-of-plane floppiness. There is a general tendency for configuration- 
ally flexible systems to attain progressively deeper energy traps despite 
mechanical work being done to the system, improving the quality of 
contact. Imposing pre-compression on suspended graphene (to pro- 
duce wrinkles) increased the total friction force several times with little 
change in the true contact area. This suggests a means of controlling 
friction of two-dimensional materials via strain engineering. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Atomistic simulations. In the molecular dynamics simulations, we primarily 
studied the stick-slip friction of a tip sliding on two graphene systems: multilayer 
graphene supported on a rough amorphous silicon (a-Si) substrate; and suspended 
monolayer graphene. The covalent bonds of C-C in graphene”? and Si-Si in amor- 
phous substrate and in the crystalline Si tip were described using the Tersoff and 
Stillinger-Weber potentials, respectively. A typical 6-12 Lennard-Jones potential 
was employed to describe van der Waals adhesive interaction between graphene 
and the substrate, graphene and the tip, and between the graphene layers. The 
Lennard-Jones parameters were chosen such that the work of adhesion (Eq) or 
the pull-off force (faa) obtained by the molecular dynamics calculation were at 
the same scale as those from experimental measurements!>!®”-”°, as shown in 
Supplementary Table 1. The simulations were performed at 300 K using a Nosé- 
Hoover thermostat*? with the LAMMPS* code. The atomic configurations are 
displayed using AtomEye*’. 

For the graphene/a-Si substrate system, we first created an a-Si substrate by 
quenching liquid silicon from high temperature to 300 K using a cooling rate of 
10° Ks". The lateral dimension of the substrate was 43.4nm x 43.4nm (x-z 
plane) with a thickness of approximately 3.0nm. The atoms in the lowest 0.5nm 
were held fixed. Owing to the free boundary condition along the out-of-plane (y) 
direction, roughness naturally formed on the free surface. A graphene sample, 
38.5nm x 38.6nm in size was then placed above the a-Si within the adhesive inter- 
action distance, and was then allowed to relax by approaching the substrate in 
response to adhesion. The root-mean-square roughness for the a-Si substrate and 
the relaxed graphene is 1.5 A and 1.0, respectively, close to experimental values*?. 
After relaxation, a rigid (001)-oriented crystalline silicon tip in a bowl-shape with 
a radius of 16.3 nm is placed in contact with the graphene. 

The rigid tip assumption was based on the following considerations. First, previ- 
ous experiments® have demonstrated that the layer-dependent friction on graphene 
was reproducible regardless of the tip material (that is, silicon, silicon nitride, or 
diamond). Second, using a rigid tip that is non-deformable makes it easier to trace 
the important yet subtle evolution of the interfacial forces. 

To incorporate the compliance of atomic force microscope cantilever in 
experiments, we coupled harmonic springs in the x direction for pulling the tip 
and in the z direction for applying a fixed normal force. The stiffness of the normal 
springs is 30.0N m~! and 0.16N m |, comparable to the previous experiments®. 
The entire system was relaxed for 2 ns after adding the tip and further relaxed after 
the normal load was imposed. The friction tests were performed by displacing the 
lateral spring along x direction with a constant velocity of 2m s_! with a normal 
load of 0.8nN applied, and calculating the lateral force acting on the virtual atom. 
To enhance the damping of oscillation (primarily along the lateral direction), we 
artificially decreased the tip mass by a factor of ten (see Supplementary Discussion 
12). Simulations were performed at 300 K. Besides the rigid tip and the fixed atoms 
at the boundaries, all the other atoms in graphene and a-Si substrate were subjected 
to the thermostat. To further justify the rigid tip assumption, we carried out another 
simulation using a silicon tip sliding on monolayer graphene/a-Si substrate where 
all tip atoms were deformable but the topmost layer of atoms were rigid. We again 
obtained two-stage frictional behaviour (see Supplementary Discussion 13). 

In the suspended graphene system, we created an ultraflat graphene sheet 
with dimensions of 38.5 nm x 38.6 nm in the x-z plane. The boundaries were 


non-periodic. To constrain translational movement during tip sliding, the atoms 
near the boundaries were fixed rigidly. The tip was the same as that used in the sup- 
ported system. We coupled a harmonic spring with a lateral stiffness of 30.0N m7! 
and pulled the silicon tip at a velocity of 2m s~! in the x direction for friction tests. 
To minimize the boundary effects!4, a small sliding distance of 2.5 nm was used, 
compared to the much larger width of the graphene sheet (about 39 nm). Again, 
the simulations were carried out at 300K. 

Contact area. The contact area between the tip and graphene is estimated by 
setting a cutoff distance r, to judge whether graphene atoms are in intimate con- 
tact with the tip atoms. Here r,=4 A, which is slightly larger than the equilibrium 
distance of the Lennard-Jones potential imposed on the tip-graphene interaction. 
The contact area is then obtained by multiplying the total number of attached 
graphene atoms by the graphene atomic area of 2.77 A? per atom. It should be 
noted that the qualitative conclusions related to contact area are not affected even 
when we change the cutoff distance r, slightly. For example, the 20% increase of r, 
causes an increase of contact area by only 6%. 

Force analysis. To minimize the influence of thermal fluctuation on interfacial 
forces, the force analyses were performed on quasistatic configurations which were 
obtained by further relaxing the system using the conjugate gradient method. This 
relaxation does not alter the frictional behaviour qualitatively, but it makes the 
statistical results more reliable and stable. 

Kurtosis. The kurtosis is calculated as K = j1,/o*, where j14 is the fourth moment 
about the mean value and a is the standard deviation of the distribution. 
Specifically, [y= D>, (f friction (n) —f )*p(n) ando?= Ss (f friction (n) Ff p(n), 
where f friction (4) is the atomic friction force of the nth bin, p(n) is the normalized 
atom number in each corresponding bin and f is the mean value of f*'*°" (n) of 
all bins, where f = y,, ft" (7) p(n). In our case, kurtosis measures the ‘peak- 
edness and ‘tailedness’ of the force distribution. 

Data availability. The authors declare that all data supporting the findings of this 
study are available within the paper and its Supplementary Information. 
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Catalytic activation of carbon-carbon bonds in 


cyclopentanones 


Ying Xia!?, Gang Lu’, Peng Liu* & Guangbin Dong!” 


In the chemical industry, molecules of interest are based primarily on 
carbon skeletons. When synthesizing such molecules, the activation 
of carbon-carbon single bonds (C-C bonds) in simple substrates is 
strategically important: it offers a way of disconnecting such inert 
bonds, forming more active linkages (for example, between carbon 
and a transition metal) and eventually producing more versatile 
scaffolds'-'%. The challenge in achieving such activation is the kinetic 
inertness of C-C bonds and the relative weakness of newly formed 
carbon-metal bonds®!*. The most common tactic starts with a three- 
or four-membered carbon-ring system® '’, in which strain release 
provides a crucial thermodynamic driving force. However, broadly 
useful methods that are based on catalytic activation of unstrained 
C-C bonds have proven elusive, because the cleavage process is much 
less energetically favourable. Here we report a general approach to 
the catalytic activation of C-C bonds in simple cyclopentanones and 
some cyclohexanones. The key to our success is the combination 
of a rhodium pre-catalyst, an N-heterocyclic carbene ligand and an 
amino-pyridine co-catalyst. When an aryl group is present in the 
C3 position of cyclopentanone, the less strained C-C bond can be 
activated; this is followed by activation of a carbon-hydrogen bond 
in the aryl group, leading to efficient synthesis of functionalized 
c-tetralones—a common structural motif and versatile building 
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Figure 1 | Activation of C-C bonds in ring systems. a, Catalytic 

C-C activation of strained rings (for example, in cyclopropane or 
cyclobutanone). The unfavourable energetics of C-C activation can 

be compensated by the release of strain in the ring. M, transition 

metal. b, Stoichiometric rhodium-mediated C-C activation of less 
strained cycloketones. This reaction is less efficient than that shown 

in a. Ph, phenyl. c, Directed catalytic C-C activation of less strained 
cycloketimines. The yield is high for large cycloketimines, but low 

for smaller ones. Bu, butyl; coe, cyclooctene; Me, methyl; PCy3, 
tricyclohexylphosphine. d, The challenge in terms of activating the C-C 
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block in organic synthesis. Furthermore, this method can 
substantially enhance the efficiency of the enantioselective synthesis 
of some natural products of terpenoids. Density functional theory 
calculations reveal a mechanism involving an intriguing rhodium- 
bridged bicyclic intermediate. 

The most common strategy for cleaving a C-C bond uses a ring 
system, in which the unfavourable energetics of C-C cleavage can 
be compensated for by the release of strain in the ring. However, 
although there has been success in activating C-C bonds in cyclo- 
propane and cyclobutane derivatives (Fig. 1a)°"'°, the catalytic C-C 
activation of less strained or non-strained rings is underdeveloped. 
The rhodium-mediated decarbonylation of cycloalkanones has 
been described?; unfortunately, however, C-C activation for the less 
strained cyclopentanone showed low efficiency (Fig. 1b). For larger 
cyclic structures, an innovative chelation-based strategy has also been 
developed, in which ketimines prepared from the corresponding 
ketones and 2-amino-3-picoline allowed a directed C-C activation 
through a five-membered metallacycle*!°. However, although medium 
to large cyclic ketimines can be efficiently activated in this way, the 
strategy is problematic for cyclopentanone- and cyclohexanone- 
derived ketimines (Fig. 1c)'®. Our laboratory recently developed a 
catalytic C-C activation of isatins, a less strained ring system, to afford 
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the thermodynamic driving forces do not always allow oxidative 
addition with a transition metal, instead favouring the reverse process 
(C-C reductive elimination). R, hydrocarbon side chain. e, Our 
strategy for catalytic C-C activation of cyclopentanones: merging 

the unfavourable C-C activation with C-H activation, to produce an 
overall thermodynamically favoured reaction. Specifically, when using 
a cyclopentanone with an aryl group in the C3 position, the imine 
intermediate (A; with a pendant directing group, DG) should allow the 
transient C-C activation intermediate (B) to undergo an intramolecular 
C-H activation, producing rhodacycle (C). Subsequent reductive 
elimination will lead to «-tetralone derivatives. 
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2-quinolinones via decarbonylation and alkyne insertion!*®; however, 
a pre-installed pyridine directing group proved essential. Hence, to 
our knowledge, catalytic C-C activation of normal cyclopentanones 
remains an unmet challenge!”"'®. 

It has been postulated that the major difficulty comes from the 
reversibility of the C-C activation step (Fig. 1d)"®!°. In contrast to 


a 5 mol% [Rh(C,H,),Cllp 


LETTER 


three- or four-membered rings, cyclopentanones lack sufficient ther- 
modynamic driving forces to favour oxidative addition with a transition 
metal®; instead, the reverse process—C-C reductive elimination—is 
often preferred (Fig. 1d)''°. We hypothesized that such a problem could 
be solved by merging an unfavourable C-C activation with a tandem 
sp’ C-H functionalization to enable an overall thermodynamically 
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Figure 2 | Substrate scope. a, Scope of the cyclopentanones. The 
shaded box at the top shows the basic reaction: the substrates are 
3-arylcyclopentanones (1la-29a); the major products, a-tetralones 
(1b-29b), come from cleavage of the more hindered C1-C2 bond, 

while the minor products, a-indanones (1c-29c), are generated through 
cleavage of the less hindered C1-C5 bond. [Rh(C2H,4)2Cl]) is the catalyst 
precursor and 2-aminopyridine (C1) is the co-catalyst; TsOH is toluene 
sulfonic acid. Ar, aryl group. Below the shaded box are shown the major 
products and their isolated yields. b, Scope of the cyclohexanones. 
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favoured transformation’. As depicted in Fig. le, installing an aryl 
group at cyclopentanone’s C3 position should allow the transient C-C 
activation intermediate (B) generated using Jun’s temporary directing 
strategy® to undergo an intramolecular ortho C-H activation, to give 
rhodacycle (C). Subsequent sp”-sp” reductive elimination would ulti- 
mately lead to «-tetralone derivatives—well known building blocks 
used by the chemical industry and common structural motifs found in 
bioactive compounds”? *. The whole catalytic process would convert 
an aliphatic ketone to a more stable aryl ketone, accounting for the 
overall thermodynamic benefits of this transformation (AG would be 
about —6 kcal mol7!). 

To test our hypothesis, we chose 3-phenylcyclopentanone 
(1a in Fig. 2) as the model substrate. After a survey of various reaction 
parameters (see Supplementary Information and Supplementary 
Figs 1, 2), we found that using [Rh(C2H4).Cl]» as the catalyst pre- 
cursor and 2-aminopyridine (C1) as the co-catalyst in 1,4-dioxane 
indeed offered the desired C-C activation products, with an 
85% yield (Supplementary Table 1, entry 1). We observed two 
skeleton-rearranged products, in a 6.4:1 ratio. The major product, 
a-tetralone (1b), came from the cleavage of the more hindered C1-C2 
bond; the minor product, a-indanone (1c), was generated from the 
cleavage of the less hindered C1-C5 bond. Further study revealed that 
[Rh(CzH4)2Cl]2 is a superior pre-catalyst, while others are much less 
efficient (Supplementary Table 1, entries 2-4). We investigated a series 
of 2-aminopyridines, and the simple 2-aminopyridine (C1) remained 
optimal; in contrast, the secondary amines (C2 and C3) exhibited 
almost no reactivity (Supplementary Table 1, entries 5-7). In terms of 
ligands, the unsaturated N-heterocyclic carbenes IMes (1,3-bis(2,4,6- 
trimethylphenyl])imidazol-2-ylidene) and the bulkier IPr (1,3-bis(2,6- 
diisopropylphenyl)imidazol-2-ylidene) afforded comparable results 
(Supplementary Table 1, entry 8); in contrast, saturated N-heterocyclic 
carbenes (such as SIMes: 1,3-bis(2,4,6-trimethylpheny]l)-4, 
5-dihydroimidazol-2-ylidene) showed substantially diminished 
reactivity (Supplementary Table 1, entry 9). Note that phosphine 
ligands, such as PCy3 and BINAP (2,2’-bis(diphenylphosphino)-1,1’ 
-binaphthyl), failed to promote the transformation (Supplementary 
Table 1, entries 10, 11). Interestingly, lowering the loading of amine 
co-catalyst C1 enhanced the regioselectivity, such that the a-tetralone 
product was formed almost exclusively, albeit with a compromised yield 
(Supplementary Table 1, entry 12). Increasing the reaction temperature 
failed to improve the yield (Supplementary Table 1, entry 13), but 
addition of 50 mol% of water increased the yield to 86% without loss 
of regioselectivity (Supplementary Table 1, entry 14). 

We then investigated the substrate scope of this C-C activation 
reaction (Fig. 2). Under optimized reaction conditions (condition 
A: 25 mol% of the co-catalyst 2-aminopyridine; temperature 140°C), 
standard substrate 1a gave a somewhat lower yield when the reaction 
was run on a 0.2mmol or 1.0 mmol scale. Under a slightly modified 
condition (condition B: 50 mol% of 2-aminopyridine; temperature 
150°C), the conversion of 1a increased and the major product (1b) 
was isolated at a yield of 73%, although the regioselective ratio (r.r.) 
dropped from >10:1 to 5.8:1. Substrates bearing an electron-neutral or 
electron-rich substituent at the para-position of the aryl ring afforded 
comparable yields (products 2b-6b). For cyclopentanones with an 
electron-deficient aryl group, the reactions were much slower under 
condition A; however, a decent yield and excellent regioselectivity could 
be achieved under condition B (products 7b-14b). A wide range of 
functional groups—such as ester, ketone, fluoride, aryl chloride, 
cyano, sulfonyl, aryl boronate, olefin and free hydroxyl groups—are 
well tolerated in the products (7b-16b, 20b). Meta-substituted arenes 
are also competent substrates, in which C-H activation occurred site 
selectively at the less hindered position (products 17b, 18b). The more 
sterically encumbered (ortho-substituted) substrates 19a and 20a still 
exhibited high reactivity. In particular, naphthyl-substituted cyclopen- 
tanones were smoothly converted to tricyclic compounds (21b, 22b); 
a quaternary centre at the C3 position did not affect the efficiency of 
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this reaction, although the regioselectivity decreased to some extent 
(products 23b, 24b). As expected, the reaction was sluggish for cyclo- 
pentanones bearing a C2 substituent (for example, 25a); nevertheless, 
C-C activation still occurred predominantly at the more hindered site 
(to produce 25b). Moreover, the 3,4-disubstituted and ring-fused cyclo- 
pentanones are also suitable substrates, and the relative configuration is 
preserved during the reaction (products 26b, 27b). Finally, this method 
allows the preparation of unusual derivatives of complex bioactive 
molecules (28b and 29b). For example, a fused ring can be conveniently 
installed on oestrone (to produce 29b). 
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Figure 3 | Gram-scale synthesis and synthetic applications. a, Gram- 
scale reactions. Compared with the small-scale reaction, using substrate 

9a in a gram-scale reaction produced a higher percentage yield; similarly, 
when using substrate 24a, less catalyst was needed. b, Standard protocols 
can be used to add several new functional groups to our C-C activation 
product «-tetralone 9b. Bn, benzyl group. c, Applications in asymmetric 
total syntheses of terpenoids. It is known that a-tetralone (2b) is an 
intermediate for accessing erogorgiaene”””’, (R)-ar-himachalene” and 
(—)-heliophenanthrone”? in three to seven steps (shown at the bottom). 
However, enantioselective preparation of «-tetralones with a C4 stereocentre 
is non-trivial and generally requires many steps” (top left, pale grey). 

In our technique (top right, dark grey), we synthesized optically enriched 
3-arylcyclopentanones in a single step through asymmetric conjugate 
addition of cyclopentenone and arylboronic acids”; then, using our C-C 
activation approach, we isolated «-tetralone ((R)-2b) in 64% yield over two 
steps, with 94.5% chirality transfer. e.r., enantiomeric ratio. 
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Compared with cyclopentanones, cyclohexanones are generally less 
strained (see the computed ring strains in Supplementary Information), 
and thus are more challenging substrates for C-C activation!». Indeed, 
under our standard conditions (A or B), no C-C activation or rear- 
rangement product was observed when using 3-arylcyclohexanones 
(for example, 30a or 35a) as the substrate. However, when using the 
bulkier IPr as the ligand and 2-amino-6-picoline as the co-catalyst 
(Fig. 2b, condition C), the C-C activation product 30c was isolated in 
34%-46% yield (67%-73% yield on the basis of recovered starting material). 
For example, the C1-C6 bond of cyclohexanone 30a was exclusively 
cleaved to furnish the product a-indanone, giving the opposite regiose- 
lectivity from cyclopentanones. In addition, cyclohexanone substrates 
containing electron-rich or electron-deficient aryl groups show compa- 
rable reactivity (products 31c-33c). Moreover, heterocyclic substrates, 
such as 34a, also underwent the C-C activation/rearrangement to 
produce alkoxy-substituted a-indanone (34c). The quaternary centre 
in the cyclohexanones appears to facilitate the C-C/C-H activation 
process, as 3-phenylcyclohexanone (35a) only afforded a trace amount 
of the product (35c). The 6,6-fused cyclohexanone 36a can also 
undergo this transformation to afford a spirocyclic a-indanone (36c). 

To demonstrate the practicability of this transformation, we carried 
out gram-scale reactions (Fig. 3a), which offered either a higher yield 
compared with the small-scale reaction (when using substrate 9a; see 
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TS4 (disfavoured) 
AG? = 37.5 kcal mol 


C-C bond activation. AG*, Gibbs free energy of activation. c, Transition 
states during C-H bond cleavage. Energies are computed at the M06/ 
SDD-6-311+G(d,p)/SMD (1,4-dioxane) level of theory, with geometries 
optimized at the B3LYP/LANL2DZ-6-31G(d) level (see Supplementary 
Information for more details and references). 


above and Fig. 2) or a lower catalyst loading (when using substrate 24a). 
To show the versatile nature of a-tetralones in organic synthesis, we 
introduced several new functional moieties to C-C activation product 
9b through standard protocols (Fig. 3b). The synthetic potential of this 
method can be further demonstrated in the asymmetric syntheses of 
natural products (Fig. 3c). For example, «-tetralone (2b) is a known 
intermediate for accessing erogorgiaene”™”!, (R)-ar-himachalene” and 
(—)-heliophenanthrone” in three to seven steps; however, enantiose- 
lective preparation of a-tetralones with a C4 stereocentre is non-trivial 
and generally requires many steps**. Given that optically enriched 
3-arylcyclopentanones can be synthesized in a single step from asym- 
metric conjugate addition between cyclopentenone and arylboronic 
acids*4, using the C-C activation approach we isolated a-tetralone 
((R)-2b) in 64% yield over two steps, with 94.5% chirality transfer. 
The slight decrease in enantioselectivity was probably caused 
by a reversible 8-H elimination process with the C-C activation 
intermediate (see rhodacycle B above; Fig. le). With an efficient route 
to (R)-2b, the synthesis of the aforementioned terpenoids is markedly 
streamlined. 

To better understand the origin of the regioselectivity and the 
mechanism of the C-H metallation step, we carried out density func- 
tional theory (DFT) calculations. The computed free energy profile 
of the key steps in the catalytic cycle is shown in Fig. 4a. Through the 
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installation of a temporary directing group, the resulting ketimines 
(both E and Z isomers) can initially bind to the rhodium in a biden- 
tate fashion to form 41a and 41b. The pyridyl group remains bound 
to the metal and promotes C-C oxidative addition (via transition 
states TS1 and TS2, Fig. 4b). While cleavage of the sterically more 
hindered C1-C2 bond in TS1 is slightly less favoured, activations 
of the Cl-C2 and C1-C5 bonds are both reversible, with a rela- 
tively low activation barrier. Indeed, the regioselectivity is deter- 
mined in the subsequent C-H metallation step (via TS3 and TS4) 
and C-C reductive elimination step (via TS5 and TS6)—both 
steps requiring much higher activation energies than that of the 
C-C cleavage. The pathway with rhodacycle 42a (derived from activa- 
tion of the C1-C2 bond) has lower barriers in both C-H metallation 
(TS3) and C-C reductive elimination (TS5) than the corresponding 
reaction with rhodacycle 42b (derived from activation of the 
C1-C5 bond). 

The substantial destabilization of the disfavoured transition states 
(TS4 and TS6) is caused mainly by the steric repulsions between the 
C5 methylene group in the forming six-membered rhodacycle and the 
ortho methyl group on the IMes ligand. This unfavourable interac- 
tion in TS4 is evidenced by the short hydrogen—-hydrogen distance 
of 2.08A (Fig. 4c). In contrast, these steric repulsions are diminished 
in TS3 and TSS, in which the methylene group is positioned further 
away from the [Mes ligand owing to the shorter tether in the forming 
five-membered metallacycle. The strong kinetic preference for the 
pathway involving the 5,6-bridged rhodacycle 43a explains the high 
level of regioselectivity for the C1-C2-activation product. The sub- 
sequent protonation (Supplementary Fig. 12) requires lower barriers 
and provides thermodynamic driving forces to finally form a-tetralone 
(1b) upon hydrolysis. Regarding the C-H metallation step, we found 
that a chloride-mediated metallation-deprotonation pathway”* was 
more favourable; in contrast, concerted 1,4-rhodium migration?**° 
through o-bond metathesis can be ruled out owing to the much higher 
activation energy required. 

This computationally proposed mechanism is also consistent with 
observed kinetic isotope effects and with isotope labelling experi- 
ments (see Supplementary Information). First, the small primary 
kinetic isotope effect suggests that the C-H metallation and C-C 
reductive elimination are both turnover limiting. Second, the incom- 
plete deuterium transfer seen in a reaction with deuterated substrate 
provides further evidence against the concerted 1,4-rhodium migration 
mechanism. 


Received 27 December 2015; accepted 6 September 2016. 
Published online 2 November 2016. 


1. Gozin, M., Weisman, A., Ben-David, Y. & Milstein, D. Activation of a carbon- 
carbon bond in solution by transition-metal-insertion. Nature 364, 699-701 


(1993). 
2. Jones, W. D. The fall of the C-C bond. Nature 364, 676-677 (1993). 
3. urakami, M., Amii, H. & Ito, Y. Selective activation of carbon—carbon bonds 


next to a carbonyl group. Nature 370, 540-541 (1994). 

4. Rybtchinski, B. & Milstein, D. Metal insertion into C-C bonds in solution. 

Angew. Chem. Int. Ed. 38, 870-883 (1999). 

5. urakami, M. & Ito, Y. Cleavage of carbon—carbon single bonds by transition 

metals. Top. Organomet. Chem. 3, 97-129 (1999). 

6. Jun, C.-H. Transition metal-catalyzed carbon-carbon bond activation. Chem. 

Soc. Rev. 33, 610-618 (2004). 

Ts iura, M. & Satoh, T. Catalytic processes involving 8-carbon elimination. 

Top. Organomet. Chem. 14, 1-20 (2005). 

8. Park, Y. J., Park, J.-W. & Jun, C.-H. Metal-organic cooperative catalysis in C-H 
and C-C bond activation and its concurrent recovery. Acc. Chem. Res. 41, 
222-234 (2008). 

9. Dong, G. (ed.) C-C bond activation Vol. 346 of Topics in Current Chemistry 
(Springer, 2014) 

10. Souillart, L. & Cramer, N. Catalytic C-C bond activations via oxidative addition 
to transition metals. Chem. Rev. 115, 9410-9464 (2015). 

11. Seiser, T., Saget, T., Tran, D. N. & Cramer, N. Cyclobutanes in catalysis. Angew. 
Chem. Int. Ed. 50, 7740-7752 (2011). 


550 | NATURE | VOL 539 | 24 NOVEMBER 2016 


12. Mack, D. J. & Njardarson, J. T. Recent advances in the metal-catalyzed ring 
expansions of three- and four-membered rings. ACS Catal. 3, 272-286 (2013). 

13. Masarwa, A. et al. Merging allylic carbon-hydrogen and selective carbon- 
carbon bond activation. Nature 505, 199-203 (2014). 

14. Halpern, J. Determination and significance of transition-metal-alkyl bond 
dissociation energies. Acc. Chem. Res. 15, 238-244 (1982). 

15. Jun, C.-H., Lee, H. & Lim, S.-G. The C-C bond activation and skeletal 
rearrangement of cycloalkanone imine by Rh(I) catalysts. J. Am. Chem. Soc. 
123, 751-752 (2001). 

16. Zeng, R. & Dong, G. Rh-catalyzed decarbonylative coupling with alkynes via 
C-C activation of isatins. J. Am. Chem. Soc. 137, 1408-1411 (2015). 

17. Mukai, C., Ohta, Y., Oura, Y., Kawaguchi, Y. & Inagaki, F. Csp?-Csp? and Csp?-H 
bond activation of 1,1-disubstituted cyclopentane. J. Am. Chem. Soc. 134, 
19580-19583 (2012). 

18. Li, G., Arisawa, M. & Yamaguchi, M. Rhodium-catalyzed synthesis of 
unsymmetrical di(aryl/heteroaryl)methanes using aryl/heteroarylmethy| 
ketones via CO-C bond cleavage. Chem. Commun. 50, 4328-4330 (2014). 

19. Albrecht, M., Gossage, R. A., Spek, A. L. & van Koten, G. Metal-mediated C-C 
bond making and breaking: first direct evidence for a reversible migration 
of a benzyl group along a metal-carbon bond. J. Am. Chem. Soc. 121, 
11898-11899 (1999). 

20. Davies, H. M. L. & Walji, A. M. Direct synthesis of (+)-erogorgiaene through a 
kinetic enantiodifferentiating step. Angew. Chem. Int. Ed. 44, 1733-1735 
(2005). 

21. Elford, T. G., Nave, S., Sonawane, R. P. & Aggarwal, V. K. Total synthesis of 
(+)-erogorgiaene using lithiationborylation methodology, and stereoselective 
synthesis of each of its diastereoisomers. J. Am. Chem. Soc. 133, 
16798-16801 (2011). 

22. Chavan, S. P. & Khatod, H. S. Enantioselective synthesis of the essential oil 
and pheromonal component ar-himachalene by a chiral pool and chirality 
induction approach. Tetrahedron Asymmetry 23, 1410-1415 (2012). 

23. Mukherjee, P. & Sarkar, T. K. Heteroatom-directed Wacker oxidations. 

A protection-free synthesis of (—)-heliophenanthrone. Org. Biomol. Chem. 10, 
3060-3065 (2012). 

24. Hayashi, T. & Yamasaki, K. Rhodium-catalyzed asymmetric 1,4-addition 
and its related asymmetric reactions. Chem. Rev. 103, 2829-2844 (2003). 

25. Lapointe, D. & Fagnou, K. Overview of the mechanistic work on the concerted 
metallation deprotonation pathway. Chem. Lett. 39, 1118-1126 (2010). 

26. Matsuda, T., Shigeno, M. & Murakami, M. Asymmetric synthesis of 
3,4-dihydrocoumarins by rhodium-catalyzed reaction of 3-(2-hydroxyphenyl) 
cyclobutenones. J. Am. Chem. Soc. 129, 12086-12087 (2007). 

27. Seiser, T., Roth, O. A. & Cramer, N. Enantioselective synthesis of indanols from 
tert-cyclobutanols using a rhodium-catalyzed C-C/C-H activation sequence. 
Angew. Chem. Int. Ed. 48, 6320-6323 (2009). 

28. Shigeno, M., Yamamoto, T. & Murakami, M. Stereoselective restructuring of 
3-arylcyclobutanols into 1-indanols by sequential breaking and formation of 
carbon-carbon bonds. Chem. Eur. J. 15, 12929-12931 (2009). 

29. Matsuda, T., Suda, Y. & Takahashi, A. Double 1,4-rhodium migration 
cascade in rhodium-catalysed arylative ring-opening/spirocyclisation 
of (3-arylcyclobutylidene)acetates. Chem. Commun. 48, 2988-2990 (2012). 

30. Matsuda, T. & Yuihara, |. A rhodium(|)-catalysed formal intramolecular 
C-C/C-H bond metathesis. Chem. Commun. 51, 7393-7396 (2015). 


Supplementary Information is available in the online version of the paper. 


Acknowledgements This project was supported by the Cancer Prevention 
Research Institute of Texas (grant R1118), the National Institute of General 
Medical Science (grant RO1GM109054) and the Welch Foundation (grant 
F1781). G.D. is a Searle Scholar and Sloan fellow. Y.X. acknowledges the 
International Postdoctoral Exchange Fellowship Program 2015 from the 
Office of China Postdoctoral Council (OCPC, document 38, 2015). We thank 
Johnson Matthey for a donation of Rh salts, and Chiral Technologies for 
donation of chiral high-performance liquid-chromatography columns. 

We are grateful to Y. Xu for providing 1,4-dioxane, F. Mo for providing some 
3-aryl cyclopentanones, and H. Lim for checking the experimental procedures. 
DFT calculations were performed using supercomputer resources at the Center 
for Simulation and Modeling at the University of Pittsburgh, and the Extreme 
Science and Engineering Discovery Environment supported by the National 
Science Foundation. 


Author Contributions Y.X. and G.D. conceived and designed the experiments. 
Y.X. performed the experiments. G.L. performed the DFT calculations. Y.X., G.L., 
P.L. and G.D. co-wrote the manuscript. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing financial 
interests. Readers are welcome to comment on the online version of the paper. 
Correspondence and requests for materials should be addressed to 

PL. (pengliu@pitt.edu) and G.D. (gbdong@uchicago.edu). 


Reviewer Information Nature thanks J. Harvey, M. Lautens and the other 
anonymous reviewer(s) for their contribution to the peer review of this work. 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


doi:10.1038/nature19842 


Magnetic reversals from planetary dynamo waves 


Andrey Sheyko!, Christopher C. Finlay? & Andrew Jackson! 


A striking feature of many natural dynamos is their ability to 
undergo polarity reversals”. The best documented example is 
Earth’s magnetic field, which has reversed hundreds of times during 
its history**. The origin of geomagnetic polarity reversals lies ina 
magnetohydrodynamic process that takes place in Earth’s core, 
but the precise mechanism is debated*. The majority of numerical 
geodynamo simulations that exhibit reversals operate in a regime in 
which the viscosity of the fluid remains important, and in which the 
dynamo mechanism primarily involves stretching and twisting of field 
lines by columnar convection®. Here we present an example of another 
class of reversing-geodynamo model, which operates in a regime of 
comparatively low viscosity and high magnetic diffusivity. This class 
does not fit into the paradigm of reversal regimes that are dictated by 
the value of the local Rossby number (the ratio of advection to Coriolis 
force)”. Instead, stretching of the magnetic field by a strong shear 
in the east-west flow near the imaginary cylinder just touching the 
inner core and parallel to the axis of rotation is crucial to the reversal 
mechanism in our models, which involves a process akin to kinematic 
dynamo waves”””. Because our results are relevant in a regime of 
low viscosity and high magnetic diffusivity, and with geophysically 
appropriate boundary conditions, this form of dynamo wave may also 
be involved in geomagnetic reversals. 

Motion of liquid metal in Earth’s core is the source of the geomag- 
netic field and its time variations, including dramatic polarity rever- 
sals wherein the large-scale field switches to point in the opposite 
direction. The past twenty years have seen considerable advances in 
our ability to model this process, with the advent of 3D numerical 
geodynamo models capable of solving the equations of conservation 
of momentum, magnetic induction and buoyancy transport in a self- 
consistent manner!!!?, Some such models show polarity reversals!3-}5, 
which proceed by the production and expulsion of inverse magnetic 
flux by intermittent convective plumes, and subsequent transport by 
meridional flow®’®. By necessity, such simulations operate in a regime 
in which viscous effects remain important. However, the values of the 
two dimensionless parameters that characterize these simulations— 
the Ekman number Ek= v/(2.2d’) and the magnetic Prandtl number 
Pr =v/n, where vis the kinematic viscosity, (2 is the angular rotation 
rate, d is the thickness of the spherical shell of convecting fluid and 
nis the magnetic diffusivity—differ greatly from the values estimated 
for Earth’s core, and so it is possible that other processes operate under 
such conditions’’. Figure 1 summarizes Ek and Pr,, for a selection of 
published models that exhibit polarity reversals (see Extended Data 
Table 1). 

Inan effort to push towards the low-viscosity regime, some researchers 
have experimented with stress-free boundary conditions in geodynamo 
models, and discovered a wide variety of quadrupolar and hemispheric 
dynamos, some of which undergo prominent field oscillations, excur- 
sions and reversals'™!* via a dynamo-wave mechanism’. Dynamo waves 
have been extensively studied in both 2D and 3D using prescribed 
flows”'?°; they involve cyclic variations in magnetic induction due 
to the combined action of differentially rotating (shear) flow and 
helical flow. Up to now, the relevance of such solutions to Earth has 
been uncertain because they have been found only in kinematic models, 
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Figure 1 | Control parameters of a selection of numerical geodynamo 
models that exhibit polarity reversals. The blue dots are the parameters 
for published reversing-geodynamo models (see Extended Data Table 1). 
The red dot shows the parameters for the reversing dynamo S6 discussed 
here. Ek =v/(2.Qd’) is the Ekman number and Pry, = v/7 is the magnetic 
Prandtl number, where v is the kinematic viscosity, {2 is the angular rotation 
rate, d is the thickness of the spherical shell of convecting fluid and 7 is the 
magnetic diffusivity. For case $6, Ek= 1.2 x 10° and Pr, =0.05. The Earth 
lies off to the bottom left in this diagram (as indicated by the arrow), with 
estimated values of Ek= O(10~) and Pr, = O(10~°). 


or in dynamic models with unusual boundary conditions or forcing 
modes. Here, we demonstrate that dynamo waves can also produce 
polarity reversals in a standard geodynamo model set-up, provided 
that Ek and Pr», are sufficiently small and that the rate of convective 
driving is sufficiently large. 

We adopt a numerical approach to solve the equations governing 
convection-driven magnetohydrodynamics in a rapidly rotating, 
electrically conducting, spherical shell (see Methods). The calcula- 
tions reported here were started from an earlier run with parameters 
and set-up as close as possible to a previously reported dynamo 
simulation”! ”, which uses among the smallest published values for 
Ek (1.2 x 107°) and Pr, (0.2). In a calculation hereafter referred to as 
‘S6, the convective driving (as measured by the Rayleigh number Ra; 
see Equation (2) in Methods) was then increased by a factor of 30 and 
Prm decreased to 0.05, in an effort to break the strong equatorial anti- 
symmetry of the generated field and to enhance its time variability. 
Figure 2a shows that, following the change in the control parameters, 
there is a gradual decrease in the magnitude of the magnetic energy 
in dynamo S6. Then, after about 1.1 magnetic diffusion times with a 
steady, dipole-dominated field, nearly periodic reversals of the dipole 
field set in (Fig. 2b). The period of the reversals is approximately 0.18 
magnetic diffusion times and they persist for the 6 magnetic diffusion 
times that we have been able to calculate so far. We repeated the 
calculations with increased model resolution, in both angular and 
radial directions, and continue to find the same reversing behaviour. 
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Figure 2 | Magnetic field time-dependence for dynamo S6. a, Time 
dependence of the magnetic energy, integrated over the volume of the 
dynamo region, for dynamo S6. The simulation was started from a 
previous dipole-dominated run similar to the dynamo of ref. 21, then 

the Rayleigh number Ra was increased by a factor of 30 and the magnetic 
Prandtl number Pr,, decreased by a factor of 4. b, The resultant time 
dependence of the dipole tilt, determined from the first three Gauss 
coefficients {e°, 8p hi} of the magnetic field at the outer boundary. Polarity 
reversals set in around 1.1 magnetic diffusion times. 


Figure 3 presents the detailed structure and time-dependence of the 
reversing field in dynamo S6. The radial magnetic field B, at the outer 
boundary of the dynamo is primarily axisymmetric, and consists of 
bands with alternating polarity that appear at low latitude and 
subsequently move polewards (Fig. 3b, cand Supplementary Video 1). 
The northern hemisphere field is stronger and migrates polewards 
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Figure 3 | Magnetic field polarity reversal process. a, A close up of time 
dependence of the dipole tilt angle from dynamo S6 (Fig. 2b); orange dots 
mark the time instances shown in c and d. b, A butterfly plot of the 
longitudinally averaged radial magnetic field B, at the outer boundary 

of the dynamo region as a function of z=sin(A), where 4 is the latitude, 
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more slowly (see the lines in Fig. 3b, which illustrate that the speed 
dz/dt, where z=sin(A) and ) is the latitude, is 50% faster in the 
southern hemisphere). The reversing field at the outer boundary is 
linked to deep structures that are evident in meridional sections of the 
longitudinally averaged azimuthal field B , (Fig. 3d and Supplementary 
Videos 2, 3). In the equatorial plane, the sign of B, alternates, and the 
field subsequently migrates polewards in each hemisphere, moving 
parallel to the rotation axis. 

In Fig. 4, we examine the underlying flow that generates this dis- 
tinctive pattern of field reversals. Figure 4a shows the time-averaged 
kinetic energy spectrum as a function of spherical harmonic degree. 
The kinetic energy in the reversing regime is much larger than the 
magnetic energy, and it decays more slowly with spherical harmonic 
degree. It is nonetheless dominated by degrees below 20, and has a 
notably large zonal component. Snapshots and animations of the 
flow (Supplementary Videos 4, 5) reveal columnar flow patterns 
with strong alignment parallel to the rotation axis, typical of low-Ek 
rotating convection. The time and longitudinally averaged flow in 
S6 is presented in Fig. 4b, c, which shows meridional sections of the 
angular velocity (differential rotation) and radial flow. There is a strong 
retrograde jet close to the tangent cylinder; this produces substantial 
shear perpendicular to the rotation axis and is associated with weaker, 
but persistent, radial flow. There is a surprising difference between the 
angular velocities in the northern and southern hemispheres: prograde 
zonal flow at low latitudes in the southern hemisphere is absent in 
the northern hemisphere. This may be linked to the southern hemi- 
sphere being colder at the outer boundary (see Supplementary Video 6). 
Flow asymmetry between the northern and southern hemispheres, 
and a substantial quadrupolar component in the generated field, are 
conditions known to favour reversing behaviour in dynamos*>*. 
Neither the kinetic energy nor the structure of the flows varies much 
during the reversals. 

To further investigate the nature of the reversal process, we carried 
out a series of additional calculations, in which we varied the control 
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Figure 4 | Time-averaged flow in dynamo S6. a, Volume-averaged spectra 
of the kinetic energy (blue) compared to the magnetic energy (red), time- 
averaged over the reversing interval. b, A meridional section showing the 
time and longitudinally averaged angular velocity w, with blues denoting 
retrograde velocities and reds denoting prograde velocities. c, Similar 
meridional section for the radial flow (u,, radial velocity); reds show 
radially outward flow. 


parameters and set-up of the dynamo (see Extended Data Table 2). 
Upon increasing Ra, which enhances the magnetic Reynolds number 
Re,, and the amplitude of the zonal flow, we continue to find similar 
reversals, but with shorter periods and increasing time variability 
(runs $6.04-S6.08). Decreasing Pry, much further is difficult without 
losing the dynamo, but small decreases slightly shorten the reversal 
period (runs $7.02, $7.03). Replacing the electrically conducting 
inner core with an insulating inner core makes little difference (the 
reversal period becomes very slightly shorter; see run S6_InsIC), 
presumably because the reversals result primarily from processes 
outside the tangent cylinder. We also carried out one calculation with 
the Lorentz force switched off (case S6_LorOff). Very similar reversals 
were then obtained, but with a shorter period of only 0.07 magnetic 
diffusion times. The helicity of the flow did not change much upon 
switching off the Lorentz force, but the energy in the zonal part of the 
toroidal flow increased by a factor of five. A calculation using a revised 
model ‘S6-9; in which the internal heating is set to zero (and all other 
parameters remain the same as in S6; see Extended Data Table 2), has 
established that in this case the dynamo continues to reverse in a similar 
manner, with its period differing by less than 5% from that of S6 
(see Methods). 

We next consider whether these reversals can be the result of a 
dynamo-wave process. A first clue is that Parker’s theory” predicts 
poleward propagation of dynamo waves along cylinders parallel to the 
rotation axis, in the case that the magnitude of the (westward) angular 
velocity decreases with distance away from the tangent cylinder and 
helicity is negative in the northern hemisphere. This is in agreement 
with our results. In addition, the period of dynamo waves can be 
estimated by'®!8 
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Tparker — 2n-yLH(Erex)!/2) 1? (1) 


where Tparker is the predicted reversal period, y is a constant, 
H= fm —i)-V x (u—a)dV| is the modulus of the volume- 


integrated kinetic helicity of the non-axisymmetric flow that we 
evaluate in the northern hemisphere, and Eo, is the zonal toroidal flow 
energy. Both quantities were time-averaged over the reversing interval. 
Upon comparing the predictions of this theory with the reversal 
periods found in our numerical calculations (Extended Data Table 2) 
we find a reasonable agreement (Extended Data Fig. 1), especially 
regarding the decrease in reversal period as the energy of toroidal zonal 
flow increases. 

Dynamo S6 thus demonstrates that field oscillations and reversals 
can result from dynamo waves produced by rapidly rotating convection 
ina spherical shell, even in the presence of no-slip boundary conditions, 
provided that Ek and Pr, are sufficiently small. Dynamo waves are 
apparently a very robust phenomenon, relevant to dynamos across a 
range of planetary”*” and astrophysical”® scenarios. A common aspect 
of such dynamos is that they possess a relatively large toroidal magnetic 
field, obtained by stretching due to differential rotation. Strong 
azimuthal fields just outside the tangent cylinder, which are related to 
an omega effect taking place at this location, are a characteristic feature 
of the reversing dynamos reported here. 

Our more Earth-like parameter regime brings further surprises. 
The conventional wisdom is that the local Rossby number Ro, (see 
Methods) is important for delineating two regimes of behaviour’: for 
Ro; < 0.1 the dynamos are generally dipolar and of stable polarity, 
whereas, as the driving is increased and inertial effects play a more 
important part, the dynamos become multipolar and reversing”?-*. 
In these previous studies, the dynamos have viscous and Ohmic 
diffusivities that are broadly similar (Pr, = O(1)). S6, with its small 
Pr, and Ek, has Ro; = 0.06 (Extended Data Table 2) and so contradicts 
the accepted regime division by lying in the purportedly stable dipolar 
regime. Similarly, our dynamo S6 should lie close to the non-reversing 
regime (Ro; < 0.05) according to a study of magnetic reversal frequency 
scaling’. Conversely, it exhibits a reversal frequency that is predicted to 
occur only at Ro|= 0.4 according to dynamos for which Ek >3 x 10~4 
and 3 < Pry», <20. 

The polarity reversals exhibited by S6 are nearly periodic, but if 
the Rayleigh number is increased, for example, as in $6.07, the time 
dependence becomes more variable, and Earth-like events such as 
dipole excursions become possible (Extended Data Fig. 2). Moreover, 
at a radius corresponding to Earth’s surface, the field in S6 is dipole- 
dominated between reversals (Supplementary Video 7). Our results 
therefore indicate that dynamo waves could indeed play a part in long- 
term geomagnetic variability”**. Unfortunately, it is currently a huge 
computational undertaking to produce the long time series needed 
to fully characterize the statistics of reversals in the low-Ek regime: 
S6 required 4 x 10° CPU hours (approximately 457 CPU years) to 
integrate for 6.5 magnetic diffusion times. More complete studies of 
the dynamics of polarity reversals in dynamos at low viscosity may 
therefore have to wait for new numerical tools that are capable of more 
efficiently exploring this challenging regime. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Governing equations and non-dimensionalization. We adopt the Boussinesq 
approximation for convection-driven, rotating magnetohydrodynamics, which 
results in the following non-dimensional equations: 


[Ros — BV? |= — ve 
Ot 
0 2 
“ _v?|B=V x (ux B) 
ot 


fe 
at 


N, = Rou x (V x u) +(V x B) x B+ qRaTr—Z xu 


-4¥'}t=e—u. VT 


where 


The variables u, B and T are the velocity, magnetic field and temperature, 
respectively. VP is the modified pressure that contains information about 
conservative forces. The axis of rotation of the system is z and Z is a unit vector in 
that direction. Time is denoted as ¢. A uniform heat source ¢ is included. Solenoidal 
conditions V-B=0 and V-u=0 are integrated into the solution technique through 
use of a poloidal-toroidal decomposition of the vector field. The non-dimensional 
parameters are 


Ro=7/ (22d?) 

Ek = v/(2d’) 

Ra= ga ATd/(22k) 
q=r/n 


(2) 


where Ro is the magnetic Rossby number, Ek is the Ekman number, Ra is the 
modified Rayleigh number and q is the Roberts number. 

The units of length, time, magnetic field and temperature for the non- 
dimensional governing equations are chosen as 


t—(d?/n)t, 


The following symbols denote the parameters of the system: 2 = (22 is the rotation 
rate, /to is the permeability of free space, fo is the density, v, « and 7 are the 
kinematic viscosity, thermal diffusivity and magnetic diffusivity, respectively, a is 
the thermal expansivity and d is the thickness of the spherical shell of convecting 
fluid. The unit of temperature AT is chosen to be Gd/ 3", where 3 and 3° = 
—2(1 — r/ro) are the dimensional and non-dimensional temperature gradients at 
the core-mantle boundary (CMB), respectively, and r;,. are the inner and outer 
radii of the shell, respectively. Gravity is assumed to vary linearly with radius and 
has value g on the outer boundary. The spherical coordinates are denoted (r, 6, y). 
Boundary conditions and internal heating. The modelled fluid is enclosed in a 
rotating spherical shell between radii r, and r, with c=1;/r. = 0.35. Both boundaries 
are no-slip and impermeable. The outer boundary is electrically insulating, the 
inner core has the same electrical conductivity as the outer core. In case S6_InsIC 


the inner core is insulating. The inner-core temperature is kept constant at 
T= ce+e+2 

2(1—c)e 
(1 — c). A uniform heat source with ¢ = 3q is adopted throughout the outer core. 


Bo (22pottgn)'/?B, ToATT, d=n-mnri 


r— dr, 


; the gradient of the temperature on the outer boundary is 3° =—2/ 


In run S6-o, the uniform heat source ¢ is set to zero. 


Diagnostics. The dipole latitude is calculated as arcsin kt ie ( gy + ( gi) + (ni) : 


where {gi 8» hi} are the first three Gauss coefficients of the magnetic field at the 


outer boundary. The kinetic and magnetic energies are defined as Ekin = ~ if wdV 
and Emag = =x f B?dV, where the volume integral is over the entire outer core. 


uROREm; the magnetic Reynolds 


The local Rossby number® is defined as Ro;=2 
Yujl 
2Ekin 
energy per harmonic degree |; U = ./2E,in/V; and V is the volume of the spherical 
shell. 

Numerical set-up. We solve the governing equations using a parameterization in 
spherical harmonics up to degree and order 255 for the angular component and 
528 finite-difference points in radius. A second-order predictor—corrector scheme 
is used for the time integration*‘. The time-step is adaptive and varies throughout 
the run. In the case S6, the time-step started from 10-7 magnetic diffusion times 
at the initial stage of the run with a very turbulent flow, and later increased up to 
2.5 x 10° magnetic diffusion times during the reversing interval. Parallelization 
is carried out in radius. In the linear parts of the code, data are split over 
the spherical harmonics. 528 cores were used simultaneously for one simulation. 


number Re,, is Ud/1; 1, = is the mean harmonic degree; u; /2 is the kinetic 


> 
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The bulk of the simulations and visualizations were performed on the super- 
computer Piz Daint (Cray XC 30) at the Swiss National Supercomputing Centre. 
The code was originally developed by Willis*° and then subsequently optimized 
for the Cray XC 30 and successfully benchmarked against other dynamo codes**. 
Importance of the heating mode. We carried out an additional simulation S6-o 
to test how the particular choice of the internal homogeneous heating influences 
the reversing behaviour. In the two runs S6 and S6-o, the internal heating ¢ is set 
to 3q and 0, respectively. Other parameters are the same. The run S6-9 was started 
from S6. As a consequence of changing the heating mode, the temperature drop 
between the inner core boundary (ICB) and the CMB is modified (see Extended 
Data Fig. 3). An important transient effect is the secular cooling that can be defined 
as the decrease in the thermal energy in the shell per unit time: Q,.. = — otdv 
(to evaluate Qe, the mean radial temperature gradient on the ICB is theasaned: 
see Equation (5) and Extended Data Fig. 4). We plot the ratio Q,.../ Q of the secular 
cooling to the heat coming from the ICB in the steady (Qsec= 0) state in Extended 
Data Fig. 5. In the modified run $6.9, Qcec constitutes less than 5% of the heat flux 
Q from the ICB during the final magnetic diffusion time. So, we conclude that the 
contribution of the secular cooling term to the heat flux equation is small at this 
stage. 

The reversal period in 6-9 increased by less than 5% in comparison with that in 
S6 (see dipole latitude time dependencies for both runs in Extended Data Fig. 6). 
This test suggests that the inclusion or not of the internal heating term is not crucial 
to the dynamo-wave reversals exhibited by our dynamos. Changes in the internal 
heating term do nevertheless slightly alter the value of the reversal period obtained. 
Volume-integrated terms of the heat equation. The heat flux equation is 


[2 —4q¥?|r=e u-VT 
ot 
Ina quasi-stationary state if oTdV=0, if no-slip boundary conditions (u,,= ug=0) 
Vv 
are applied, then 
fuvrav= f{ ur-ds=o 
Vv av 


and the integrated heat flux equation is 
OT 
‘Sa 

% 


This is equivalent to Q; + Qint = Qo, whereby the sum of the heat supplied by 
internal heating Qin and that from the inner core Q; is equal to the heat leaving 
through the outer boundary Qo. 
The temperature gradient at the CMB was held fixed at 3° — =| =—2/(1—0), 
% 


ds 


ds+ev=—q f 
un r r 


with c=rj/r)= 0.35. In S6 the internal heating Qin =3qV. The heat leaving through 
the outer boundary Q, = — 4qnr23”. The temperature gradient on the ICB in run 
S6 is 


aT une & 
“| =—(Q,- Q,)/(4qnr?) = Z + 5 i= 13.1 (3) 
Or |,, c c 
In the case $6.9 without internal heating, the temperature gradient at rj is 
oT : 
OF) = Q,fagnr?) =% = —25.1 (4) 
Or |,, c 


Without internal heating, in a steady case the heat from the ICB goes through the 
shell and leaves without losses or gains at the CMB. The power flowing through the 
CMB and the ICB is equal, although the heat flux is smaller at the CMB because it 
is inversely proportional to the surface area, yielding the coefficient (ro/r;)? = 1/c? 
in equation (4). 

Let us consider the non-steady case: 


aT 
Q+Qn=Q + f Sav 
Vv 


sec 


whereQ - q. ordv is the secular cooling. If the heat flux from the inner core 
Vv 

: + 0 0 F 

in the steady case is Q;, the Q,,. = Q; — Q,. So, the difference between the steady 

state and the actual value of the heat flux from the inner boundary is the secular 

cooling. The relative influence of the secular cooling can be calculated as 


Qeee/V=1 — Q,/Q (5) 
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Extended Data Figure 1 | Systematics of reversal period versus dynamo- _ from volume-averaged helicity and zonal toroidal kinetic energy time- 
wave predictions. Comparison of the scaling of periods predicted by averaged over the last three reversals in the simulation or, in shorter 
Parker’s dynamo-wave theory (Equation (1)) and the periods obtained simulations, during the second half of the reversing interval, with y= 200 
in the dynamo calculations. Periods in the numerical calculations are (Equation (1)). Numerical values of measured and estimated periods are 
determined from spectral analysis of the time series of dipole latitudes. provided in Extended Data Table 2. 


The predicted periods from Parker’s dynamo-wave theory are calculated 
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Extended Data Figure 2 | Time dependence of dipole tilt for dynamo $6.07. The time dependence of the dipole tilt, determined from the first three 
Gauss coefficients {g°, 8p hy} of the magnetic field at the outer boundary. Units are magnetic diffusion times. $6.07 was started from a previous run $6.06, 


and then the Rayleigh number Ra was increased by a factor of two. 
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Extended Data Figure 3 | The temperature drop between the ICB and the CMB in runs S6 and S6-9. Run S6-o starts from S6; the temperature drop 
increases after the internal heating is switched off. 
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Extended Data Figure 4 | Radial temperature gradient on the ICB in runs S6 and S6<o. Steady-state values are —13.1 and —25.1 for runs S6 and S6-o, 
respectively (see Equations (3) and (4)). 
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Extended Data Figure 5 | Secular cooling normalized by the steady-state heat flux through the CMB (see Equation (5)), for runs S6 and S6-». 
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Extended Data Figure 6 | Time dependence of dipole latitude for dynamos S6 and S6-9. The plot for S6 is shifted along the time axis to overlap $6.0. 
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Extended Data Table 1 | Selection of previous geodynamo model reversal studies included in Fig. 1 


Ek Pry, = Ek/Ro Author 
O(10-15) O(10-5) Earth 
10-3 — 10-4 0.67-6 Kutzner and Christensen (2002) °° 
10-2 —3 x 10-4 20 Rotvig (2009) °” 
5 x 10-3 20 Driscoll and Olson (2009) %8 
6.5 x 1073 20 Olson et al. (2009) ® 
10-5 0.5 Takahashi et al. (2007) °° 
10-2 20 Wicht and Olson (2004) '§ 
6 x 1073 20 Olson and Deguen (2012) *° 
2x107-2-—3x10-4 3410 Wicht et al. (2009) +1 
10-5 0.5 Busse and Simitev (2008) '® 
2x10-?-3x10-4 3+10 Aubert et al. (2008) 
1.1 x 1072 10 Kida et al. (1997) ‘9 
3x 1074 5 Sreenivasan et al. (2014) * 
1.1834 x 107§ 0.04—0.05 Present study 


We list two of the important control parameters, the magnetic Prandtl number Prm and the Ekman number Ek. Our aim was to make Prim substantially less than unity and 
to decrease the smallest previously reported value for Ek by at least an order of magnitude. Values from previous studies are from refs 6, 13, 16, 18, 29, 37-43. 
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Extended Data Table 2 | Runs exploring variation of reversals with control parameters and set-up 


Ek Ro Ra q Rem Ro Exin/Emag  Etor AH Tose TParker 
S6 1.1834e-06 2.3669e-05 6591 0.050 89 0.064 9.22 5.08e+03 6.51e+05 0.184 0.185 
$6.04 1.1834e-06 2.3669e-05 7250 0.050 94 0.071 8.93 5.65e+03 6.98e+05 0.182 0.174 
$6.05 1.1834e-06  2.3669e-05 10000 0.050 105 0.084 6.53 6.98e+03 8.45e+05 0.174 0.150 
$6.06 1.1834e-06 2.3669e-05 100000 0.050 333 0.226 3.22 9.19e+04 4.88e+06 0.063 0.033 
$6.07 1.1834e-06 2.3669e-05 200000 0.050 388 0.258 2.91 1.40e+05 6.08e+06 0.050 0.026 
$6.08 1.1834e-06 2.3669e-05 250000 0.050 396 0.286 3.06 1.40e+05 6.30e+06 0.043 0.026 
$7.02 1.1834e-06  2.6299e-05 6591 0.045 84 0.066 12.56 5.73e+03 5.41e+05 0.178 0.196 
$7.03 1.1834e-06  2.9586e-05 6591 0.040 83 0.065 31.23 8.15e+03 4.61e+05 0.129 0.195 
S6_InsIC 1.1834e-06 2.3669e-05 6591 0.050 91 0.066 10.27 5.57e+03 6.56e+05 0.178 0.180 
S6_LorOff 1.1834e-06 2.3669e-05 6591 0.050 119 0.061 - 2.54e+04 6.92e+05 0.073 0.120 
S6-0 1.1834e-06  2.3669e-05 6591 0.050 100 0.076 6.12 6.00e+03 7.92e+05 0.192 0.160 


Tosc is the period of reversals determined from spectral analysis of the model dipole tilt time series. Tparker is the period of reversals determined by Equation (1) with y=200. Etor is the energy of the 
axially symmetric component of the toroidal flow. H is the modulus of the volume-integrated kinetic helicity of the non-axisymmetric flow in the northern hemisphere. Definitions of other parameters 
and diagnostics are provided in Methods. Presented diagnostics are averaged over the last three reversals or, in shorter simulations, during the second half of the reversing interval. 
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A cannabinoid link between mitochondria and 


memory 


Etienne Hebert-Chatelain!***, Tifany Desprez!*, Roman Serrat!**, Luigi Bellocchio! 


2, 1,2 


4% Edgar Soria-Gomez!?, 


Arnau Busquets-Garcia!’, Antonio Christian Pagano Zottola>?, Anna Delamarre!?, Astrid Cannich!’, Peggy Vincent!?, 

Marjorie Varilh!’, Laurie M. Robin!?, Geoffrey Terral!?, M. Dolores Garcia-Fernandez*®, Michelangelo Colavita!’, 

Wilfrid Mazier!, Filippo Drago’, Nagore Puente®’, Leire Reguero®?, Izaskun Elezgarai®°, Jean-William Dupuy”, Daniela Cota!?, 
Maria-Luz Lopez-Rodriguez", Gabriel Barreda-Gémez°, Federico Massa!*, Pedro Grandes®*-?, Giovanni Bénard!?g & 


Giovanni Marsicano!s 


Cellular activity in the brain depends on the high energetic support 
provided by mitochondria, the cell organelles which use energy 
sources to generate ATP!*, Acute cannabinoid intoxication induces 
amnesia in humans and animals”, and the activation of type-1 
cannabinoid receptors present at brain mitochondria membranes 
(mtCB,) can directly alter mitochondrial energetic activity’. 
Although the pathological impact of chronic mitochondrial 
dysfunctions in the brain is well established”, the involvement of 
acute modulation of mitochondrial activity in high brain functions, 
including learning and memory, is unknown. Here, we show that 
acute cannabinoid-induced memory impairment in mice requires 
activation of hippocampal mtCB, receptors. Genetic exclusion 
of CB, receptors from hippocampal mitochondria prevents 
cannabinoid-induced reduction of mitochondrial mobility, synaptic 
transmission and memory formation. mtCB, receptors signal 
through intra-mitochondrial Ga; protein activation and consequent 
inhibition of soluble-adenylyl cyclase (sAC). The resulting 
inhibition of protein kinase A (PKA)-dependent phosphorylation 
of specific subunits of the mitochondrial electron transport system 
eventually leads to decreased cellular respiration. Hippocampal 
inhibition of sAC activity or manipulation of intra-mitochondrial 
PKA signalling or phosphorylation of the Complex I subunit 
NDUEFS2 inhibit bioenergetic and amnesic effects of cannabinoids. 
Thus, the G protein-coupled mtCB, receptors regulate memory 
processes via modulation of mitochondrial energy metabolism. By 
directly linking mitochondrial activity to memory formation, these 
data reveal that bioenergetic processes are primary acute regulators 
of cognitive functions. 

Cannabinoids can activate CB, receptors localized at the plasma 
or mitochondrial membranes’. Exogenous cannabinoids induce 
amnesia™®, but the role of specific subcellular pools of CB, receptors 
in these effects is unknown. 

In silico subcellular protein localization analysis ”*" revealed that 
the deletion of the 22 N-terminal amino acids of the CB protein 
reduced its theoretical probability of mitochondrial targeting from 
40-45% to 1-3% (Extended Data Table 1). The CB,-receptor agonists 
WIN55,212-2 (WIN) and HU210 decreased respiration in CB,- 
transfected primary mouse fibroblasts that were deficient in CB; 
(also known as Cur1) or HEK 293 cells but not in cells transfected 
with a mutant cDNA of CB, which lacked the first 22 amino acids 


10,11 


(DN22-CB,; Fig. 1a and Extended Data Fig. 1a, b). DN22-CB, receptors 
were functional, because cannabinoid stimulation of CB,- and DN22- 
CB,-transfected cells induced phosphorylation of extracellular-signal- 
regulated kinases (ERKs) and activation of G proteins to an equal extent 
(Fig. 1b, c and Extended Data Fig. 1c). 

In neurons, distal energetic support requires mitochondrial 
mobility”, which is reduced by cannabinoids in enteric neurons!”. 
HU210 significantly reduced the percentage of mobile mitochondria 
in axons of CB,-, but not DN22-CB)-transfected hippocampal neurons 
from CB; ‘~ mice (Fig. 1d, Extended Data Fig. 1d and Supplementary 
Videos 1, 2). 

Semi-quantification of immunogold electron microscopy images® 
revealed that the injection of an adeno-associated virus expressing CB, 
(AAV-CB)) into the hippocampus of CB; _‘~ mice (denoted hereafter 
as CB,~'~(CB,) mice) induced the same proportion of mtCB, receptors 
as in control mice injected with an AAV expressing GFP (CB 1*'*(GEP) 
mice; Fig. le, f). CB,/~(DN22-CB,) mice expressed comparable 
amounts of total CB; protein as CB 1’ (CB,) mice (Extended Data 
Fig. 2a, b), but the proportion of mtCB, receptors was similar to nega- 
tive control CB,;~/~(GEP) littermates (Fig. le, f). °°S]GTP+ binding 
assays in hippocampal extracts of CB,'~(GEP), CB,~'~(CB,) and 
CB, '~(DN22-CB;) mice revealed that CB; and DN22-CB, receptors 
were functional ex vivo to an equal extent (Fig. 1g and Extended Data 
Fig. 2c-e). Thus, DN22-CB, is excluded from mitochondria without 
losing functionality, representing a useful tool to study mtCB, receptors 
in brain processes. 

CB,-receptor agonists reduce excitatory synaptic transmission in 
hippocampal slices from wild-type animals, but not from CB,~/~ 
mice!’, This effect was rescued in slices from CB, ~/~(CB,), but not 
from CB,~'~ (DN22-CB;) mice (Fig. 1h). Post-training cannabinoid 
administration induces amnesia in the novel object recognition 
(NOR) task'*. Injection of WIN impaired the NOR performance of 
CB,*/*(GEP) mice, but not of CB,’ (GEP) littermates, without alter- 
ing total exploration (Fig. 1i and Extended Data Fig. 2f). This effect 
was fully rescued in CB, ~'~(CB,), but not in CB, ~/~(DN22-CB;) mice 
(Fig. li and Extended Data Fig. 2f). 

We subsequently investigated intra-mitochondrial mtCB, receptor 
signalling. The cannabinoids A®-tetrahydrocannabinol (THC) and 
WIN decreased mitochondrial respiration in hippocampal cell cultures, 
reducing cellular and mitochondrial ATP content (Extended Data 
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Figure 1 | Requirement of mtCB, receptors for cannabinoid effects 
on mitochondrial respiration and mobility, synaptic transmission 
and memory. a, The effect of WIN (211M) on cellular respiration of 
primary mouse fibroblasts (MEFs) from CB,~/~ pups (postnatal day 
(P)0-P1) transfected by electroporation with mCherry (mock)-, CB,- or 
DN22-CB)-expressing vectors (1 = 3). b, The effect of HU210 (100 nM) 
on ERK phosphorylation in HEK293 cells transfected with mCherry- 
(n=8), CB,- (n=13) or DN22-CB)-expressing plasmids (n= 5). 
c, Dose-response curves of the effect of WIN on [*°S]GTP4S binding 
in membranes isolated from HEK293 cells transfected with mCherry- 
(n= 3-4), CB,- or DN22-CB)-expressing plasmids (n = 12-14). d, Effect 
of HU210 (11M) on the percentage of mobile mitochondria in axons from 
primary hippocampal neurons obtained from CB,~/~ pups (P0-P1) and 
transfected with vectors expressing GFP (n= 17), CB,-GFP (n= 32) or 
DN22-CB,-GFP receptors (n= 14). e, Representative electromicrographs 
(3 independent experiments) of immunogold staining of CB, receptors in 
the CA1 hippocampal region of CB;*/*(GFP), CB,~/~ (GFP), CB,;~'~(CBy), 
or CB, /~(DN22-CB,) mice. Scale bars, 0.5 zm. M, mitochondria; 
yellow arrows, plasma-membrane CB, receptors; red arrows, mtCB 
receptors; blue arrows, excluded immunogold particles that were located 
on mitochondrial membranes but at a distance <80 nm from other 
Fig. 3a, b). Mitochondrial G, proteins'*'° co-immunoprecipitated 
with CB), and THC induced their release (Fig. 2a, b and Extended 
Data Fig. 3c). The G,, inhibitor pertussis toxin (PTX) blocked the 
cannabinoid-induced decrease in mitochondrial cAMP, in protein 
kinase A (PKA) and complex I activity (Extended Data Fig. 3d-h) and 
in respiration (Fig. 2c, d). 

sAC produces mitochondrial cAMP, thereby regulating respiration 
through PKA-mediated phosphorylation of oxidative phosphorylation 
(OXPHOS)-associated proteins'”. As it was suggested that sAC acts 
independently of G proteins in the gonads!’, we tested the prevalence 
of G,-sAC interactions in brain mitochondria. In these organelles, sAC 
was detected along with other elements of the cAMP-PKA signalling 
cascade (Extended Data Fig. 3i) and it co-immunoprecipitated with G, 
(Fig. 2e and Extended Data Fig. 4a). THC increased this interaction 
(Fig. 2e and Extended Data Fig. 4a), suggesting that sAC might mediate 
the mitochondrial effects of mtCB, receptors. 
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CB,-(CB,) CB,“-(DN22-CB,) 


membranes””. f, Semi-quantification of the percentage of CB,-positive 
immunogold particles (mtCB,) in mitochondria (mito), normalized 

to the total number of gold particles (n = 3 mice, at least 20 images per 
mouse). g, Dose-response curves of the effect of HU210 on [*°S]GTP4S 
binding in membranes isolated from hippocampi of different mutant mice 
(CB,~!~(CB,), n= 8; CB,~/~ (DN22-CB,), n= 15; CB,;~'~(GEP), n = 22). 
h, The effect of HU210 (2.5,1M) on CA3-CA1 field excitatory post- 
synaptic potentials (fEPSP) slopes in hippocampal slices derived from 
CB,‘~(CB,) (n=5) and CB,~/~ (DN22-CB,) mice (n= 7). Left, plots 

of normalized fEPSP slopes with representative fEPSP traces before 

(1) and after HU210 treatment (2); right, histogram summarizing the 
average change in percentage of fEPSP slopes over baseline (dotted 

line). i, Cannabinoid-induced impairment of novel object recognition 
(NOR) memory requires mtCB)-receptor activation in the hippocampus. 
Discrimination index values in the object-recognition test of 
CB,*!*(GEP) (n= 16), CB,;~'~(GEP) (n=7), CB;~'~(CB,) (n= 13) and 
CB,~'~(DN22-CB,) (n = 15) mice treated with vehicle (white bars) or 
the CB, agonist WIN (5 mg kg“, intraperitoneal (i-p.), grey bars). Data, 
mean +s.e.m.; *P < 0.05; **P< 0.01; NS, not significant. For statistics, 
see Supplementary Tables 1-3. 


The activator of the non-mitochondrial membrane-bound AC, 
forskolin, had no effect on mitochondrial cAMP levels or on the 
decrease induced by the activation of mtCB, receptors (Extended 
Data Fig. 4b, c). Conversely, the sAC-activator bicarbonate (HCO; -)!8 
fully reversed the effect of mtCB,-receptor activation (Extended Data 
Fig. 4d, e) and abolished the cannabinoid-induced reduction in 
respiration (Fig. 2f, g). Overexpression of native or mitochondrial- 
enriched sAC protein!” (mtsAC; Fig. 2h and Extended Data Fig. 4g) in 
3T3 cells expressing mtCB, and its mitochondrial signalling machinery 
(Extended Data Fig. 4f) abolished the effect of THC on cellular respi- 
ration (Fig. 2i). This effect was also present in hippocampal cultures 
(Extended Data Fig. 4h, i). 

Next, we investigated whether sAC blockade prevents the effects of 
mtCB, receptors. The sAC inhibitor KH7 (refs 19, 20) slightly decreased 
respiration, but blocked the mtCB,-dependent decrease in the activity 
of PKA and complex I (Extended Data Fig. 5a-c), and in respiration 
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Figure 2 | Activation of mtCB, receptors affects mitochondrial 

activity via Gojjo proteins and sAC. a, Representative immunoblotting 

(3 independent experiments) of co-immunoprecipitation experiments of 
G, proteins with mtCB, receptors in purified brain mitochondria. 

b, Representative immunoblotting (3 independent experiments) showing 
the effect of A°-tetrahydrocannabinol (THC, 800nM) on mtCB, and 

G, proteins co-immunoprecipitation. c, The effect of pertussis toxin (PTX, 
1g ml~') on the decrease in respiration in purified brain mitochondria 
induced by THC (800 nM) and WIN (100 nM) (n= 4-8). d, The effect 

of PTX on the decrease in cellular respiration in hippocampal cultures 
induced by THC (11M) (n= 8-9). e, Representative immunoblotting 

(3 independent experiments) of co-immunoprecipitation experiments of 
G, proteins with sAC from purified brain mitochondria, after control or 
THC (800 nM) treatment. f, The effect of bicarbonate (HCO; ,5mM) 

on the decrease in respiration in purified brain mitochondria induced by 
THC (800 nM) and WIN (100 nM) (n=3-9). g, The effect of HCO; on 
the decrease in cellular respiration in primary hippocampal cultures 
induced by THC (11M) (n = 8-11). h, Representative immunoblot 

(3 independent experiments) of haemagglutinin (HA), complex III core 2 
(CIII core 2) and lactate dehydrogenase a (LDHa) showing the presence of 
sAC-HA in total cellular (TCL), cytosolic (Cyt.) and mitochondrial (Mito) 
fractions, whereas mtsAC-HA is specifically targeted to mitochondrial 
fractions of transfected 3T3 cells. i, Overexpression of sAC-HA or 
mtsAC-HA in 3T3 cells blocks the decrease in respiration induced by 
THC (800 nM) (n = 6-8). IP, immunoprecipitation; IB, immunoblot. Data, 
mean +s.e.m.; *P < 0.05; **P< 0.01; ***P< 0.001; NS, not significant. 
For statistics, see Supplementary Tables 1-3. 


in brain mitochondria (Fig. 3a). KH7 or overexpression of mtsAC 
abolished the cannabinoid effect on neuronal mitochondrial mobility 
(Fig. 3b, Extended Data Fig. 5d, e and Supplementary Videos 3, 4). 
In hippocampal slices, KH7 blocked the cannabinoid-induced reduc- 
tion of field excitatory post-synaptic potentials (fEPSPs) without 
altering synaptic transmission as such (Fig. 3c and Extended Data 
Fig. 5f, g). Eventually, the intrahippocampal infusion of KH7 abolished 
the amnesic effect of post-training WIN administration (Fig. 3d and 
Extended Data Fig. 5h). 

Phosphorylation-dependent modulation of OXPHOS proteins 
might, therefore, mediate the effects of mtCB, receptors. The PKA 
activator 8-bromoadenosine 3’,5’-cyclic monophosphate (8-Br-cAMP) 
reversed the cannabinoid-induced decrease in mitochondrial 
respiration, and the PKA-inhibitor H89 blocked the cannabinoid effects 
on the activity of mitochondrial PKA and complex I, and on respiration 
(Fig. 4a, b and Extended Data Fig. 6a—c). Silencing the A-kinase anchor 
protein AKAP121, which transports PKA to mitochondria”*”’, reduced 
intra-mitochondrial PKA levels and blocked the THC-induced effects 
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Figure 3 | Blockade of sAC activity prevents the effects of cannabinoids 
on mitochondrial respiration and mobility, synaptic transmission and 
memory. a, The effect of KH7 (541M) on the reduction in respiration 

in purified brain mitochondria induced by THC (800 nM) and WIN 

(100 nM) (n= 3-7). b, The effect of KH7 (5\1M) on the decrease in 
mitochondrial mobility in CB,-expressing neurons induced by HU210 
(1M) (n= 10-16). c, The effect of KH7 (101M) on the decrease in 
fEPSP at CA3-CA1 synapses in hippocampal slices from C57BL/6N mice 
induced by HU210 (2.5 1M) (n = 4-5). Left, plots of normalized fEPSP 
slopes with representative fEPSP traces before (1) and after (2) HU210 
incubation. Vehicle (n= 4) or KH7 (n=5) were pre-incubated 10 min 
before cannabinoid application. Right, histogram summarizing the 
average changes in percentage of fEPSP slope before (100% baseline, 
dotted line) and after HU210 treatment in presence or absence of KH7. 

d, Intrahippocampal KH7 blocks cannabinoid-induced amnesia in a NOR 
memory task. Discrimination index values in the NOR test showing the 
effect of vehicle or WIN (i.p., 5mg kg~!) in C57BL/6N mice that received 
intrahippocampal injection of vehicle or KH7 (21g in 0.5 pl). n= 7-14 per 
group. Data, mean + s.e.m.; *P < 0.05; **P< 0.01; ***P < 0.001; NS, not 
significant. For statistics, see Supplementary Tables 1-3. 


on complex I activity and respiration (Fig. 4c, d and Extended Data 
Fig. 6d-i). Systemic injection of THC reduced the PKA-dependent 
phosphorylation of brain mitochondrial proteins of wild-type, but not 
CB,~'~ mice, and the intracerebroventricular injection of KH7 blocked 
this effect (Fig. 4e, f). These results indicate that mtCB, receptors 
modulate brain mitochondrial respiration through the PKA-dependent 
phosphorylation of mitochondrial proteins. 

By using selective substrates, we observed a specific effect of THC 
on complex-I-dependent respiration in brain mitochondria (Extended 
Data Fig. 7a), suggesting that mtCB, receptor signalling specifically 
targets this complex. Analysis of the PKA-dependent phosphorylation 
of brain mitochondrial proteins revealed that the subunit NDUFS2 
was consistently targeted by PKA in an mtCB,-signalling-dependent 
manner (Fig. 4g—j and Extended Data Fig. 7b). 

To study the functional role of the PKA-NDUF32 signalling cascade, 
we first fused a constitutively active mutant form of PKA (PKA-CA)** 
to a mitochondrial leading sequence (MLS), inducing the mitochon- 
drial localization of the mutant protein (MLS-PKA-CA; Extended Data 
Fig. 7c). Expression of MLS-PKA-CA inhibited the WIN-induced respi- 
ration decrease both in CB,-co-transfected HEK293 cells (Extended 
Data Fig. 7d, e) and in hippocampal extracts from wild-type mice that 
received hippocampal injection of an AAV expressing the mutant 
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Figure 4 | Activation of mtCB, decreases PKA-dependent 
phosphorylation of mitochondrial proteins. a, The effect of 8-Br-cAMP 
(500 1M) on the reduction in respiration in purified brain mitochondria 
induced by THC (800 nM) and WIN (100 nM) (n =3-7). b, The effects 

of H89 (800 nM) on the reduction in respiration in purified brain 
mitochondria induced by THC (800 nM) and WIN (100 nM) (n=3-7). 

c, d, Silencing of AKAP121 (using small hairpin RNA, (sh)AKAP121) 
lowers mitochondrial PKA and blocks the effect of mtCB-receptor 
activation. c, Representative immunoblots (3 independent experiments) of 
AKAP121, PKA, CIII core 2 and LDHa in total cellular, cytoplasmic and 
mitochondrial lysates from 3T3 cells treated with shAKAP121 or control. 
d, The effect of AKAP121 silencing on the reduction in cellular respiration 
in 3T3 cells induced by THC (800 nM) (n= 4). e, f, Activation of mtCB, 
decreases PKA-dependent phosphorylation of brain mitochondrial 
proteins. e, Representative immunoblots (3 independent experiments) 
showing the effect of THC (i-p., 10mg kg~') on PKA-dependent 
phosphorylation levels of brain mitochondrial proteins (n = 3). TOM20, 
translocase of outer membrane subunit 20. f, Representative immunoblots 
(3 independent experiments) showing the effect of centrally-injected 

KH7 (intracerebroventricular injection (i.c.v.), 5 nmol) on the decrease 


protein (AAV-MLS-PKA-CA,; Fig. 4k and Extended Data Fig. 7f). The 
hippocampal expression of MLS-PKA-CA did not alter mouse memory 
in vivo, but it blocked the amnesic effect of WIN (Fig. 41). We then 
generated a mutant form of NDUFS2, carrying point mutations at con- 
sensus sites for PKA-dependent phosphorylation”*”®, thereby providing 
a phosphomimetic NDUFS2 protein (NDUFS2-PM). NDUFS2-PM 
expression in CB,-co-transftected HEK293 cells did not change cellular 
oxygen consumption, but it blocked the effect of WIN on cellular respi- 
ration (Extended Data Fig. 7c-e). The viral expression of NDUFS2-PM 
in mouse hippocampi (Extended Data Fig. 7f) inhibited the effect of 
WIN on respiration in hippocampal homogenates and substantially 
reduced the amnesic effect of the cannabinoid agonist (Fig. 4k, 1). 
These results show a previously unknown link between acute brain 
mitochondrial activity and memory, in which direct bioenergetic 
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in PKA-dependent phosphorylation of brain mitochondrial proteins 
induced by systemic THC injection (i.p., 10 mg kg~'). SDHA, succinate 
dehydrogenase subunit A. g-j, Activation of mtCBj receptors decreases 
PKA-dependent phosphorylation of the complex I subunit NDUFS2. 

g, h, Representative immunoblotting (g; 3 independent experiments) and 
quantification (h; n =5) of PKA-dependent phosphorylation and total 
NDUEFS2 of immunoprecipitated complex I proteins from purified brain 
mitochondria treated with vehicle or THC (800 nM). i, j, Representative 
immunoblotting (i, 3 independent experiments) and quantification 

(j, 1 =3-7) showing the effects of HCO3° (5 mM) and 8-Br-cAMP 

(500 1M) on the decrease in PKA-dependent phosphorylation of NDUFS2 
in brain mitochondria induced by THC (800 nM). k, The effect of 

WIN (100 nM) on respiration of homogenized hippocampi from mice 
that received intrahippocampal injection of AAV-MLS-PKA-CA or 
AAV-NDUFS2-PM (n=6). 1, The effect of WIN (5 mg kg!) on NOR 
discrimination index of mice intrahippocampally injected with 
AAV-MLS-PKA-CA or AAV-NDUFS2-PM (n= 8-11). Data, 
mean+s.e.m.; *P < 0.05; ** P< 0,01; ***P < 0.001; NS; not significant; 
+P <0.05 compared to vehicle; +P < 0.05; $£$P < 0.001 compared to 
AAV-GFP WIN. For statistics, see Supplementary Tables 1-3. 


effects of cannabinoids play a central role. The activation of intra- 
mitochondrial Ga; proteins by brain mtCB, receptors triggers inhi- 
bition of sAC activity, a reduction in cAMP levels and a decrease in 
PKA-dependent phosphorylation of OXPHOS proteins, among which 
the complex I subunit NDUFS2 plays a notable functional role. This 
cascade of events leads to a decrease in brain mitochondrial activity, 
which is required for the acute effects of cannabinoids on mitochon- 
drial mobility, synaptic depression and eventually amnesia. 
Mitochondria provide several necessary substrates for cells, having 
a particularly limiting role in brain functions!*. Whereas long- 
term mitochondrial alterations participate in the pathogenesis of 
neurological and neuropsychiatric diseases’ and can alter memory 
processes””8, recent evidence shows that even very short interrup- 
tions of the ATP supply exert marked effects on synaptic activity’. 
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The mtCB,-dependent decrease in OXPHOS functions and ATP 
production represents a likely mechanism connecting mitochondrial 
functions and cannabinoid-induced synaptic and amnesic effects. More 
indirectly, recent work showed that the intracellular mTORC1 path- 
way participates in the cannabinoid-induced amnesic effects'*. The 
brain mTORC1 complex bidirectionally interacts with mitochondria”’, 
suggesting that a common mitochondria~-mTORC1 pathway might 
be involved in cannabinoid amnesic effects. Recent data revealed a 
biphasic effect of CB) receptor agonists in the hypothalamus, with low 
doses increasing and high doses decreasing respiration, respectively”. 
Thus, brain-region-specific or cell-type-specific differences in the sig- 
nalling of mtCB, receptors might exist. 

This study demonstrates that at least one G-protein-coupled 
receptor present on mitochondrial membranes modulates high brain 
functions such as memory formation through the modulation of intra- 
mitochondrial G-protein signalling. Considering that G proteins play a 
central role in the brain, the present data will probably pave the way for 
anew field of research that deals with the acute effects of mitochondrial 
activity on brain functioning. 

Cannabinoid drugs have several therapeutic potentials”, unfor- 
tunately limited by important side effects, such as impairment of 
memory”®. The present data suggest that selective targeting of specific 
subcellular populations of CB, receptors in the brain might assist in 
development of safer therapeutics against several brain disorders. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Mice. Experiments were approved by the local ethical committee of the University 
of Bordeaux (approval number 501350-A) and the French Ministry of Agriculture 
and Forestry (authorization number 3306369). Mice were maintained under 
standard conditions (food and water ad libitum; 12 h-12h light-dark cycle, light 
on at 7:00; experiments were performed between 9:00 and 17:00). Male C57BL/6N 
mice were purchased from Janvier (France). Wild-type (CB,*/*) and CB; ~/~ female 
and male mice (2-4 months old) were obtained, bred and genotyped as described*!. 
Only male mice were used for behavioural experiments. For most experiments 
CB,*/* and CB,~/~ were littermates. For primary cell cultures, pups were obtained 
from homozygote pairs. No method of randomization to assign experimental 
groups was performed and the number of mice in each experimental group was 
similar. No statistical methods were used to predetermine sample size. 

Drugs. THC was obtained from THC Pharm GmbH (Frankfurt, Germany). 
HU210 was synthesized as described**. WIN55-212-2, KH7, PTX, bicarbonate 
(HCO3_), forskolin, carbonyl cyanide-4-(trifluoromethoxy)phenylhydrazone 
(FCCP), oligomycin, antimycin, rotenone, picrotoxin, GTPS and other chemicals 
used in this study were purchased from Sigma-Aldrich (St-Louis, USA). [7H] 
CP55,940 (162.5 Cimmol~') and [*°S]GTP*S (1,250 Cimmol~!) were purchased 
from Perkin Elmer NEN (Boston, USA). 

For in vivo administration, WIN was dissolved in a mixture of saline (0.9% 
NaCl) with 2% DMSO and 2% cremophor; THC was dissolved in a mixture of 
4% ethanol, 5% cremophor and saline; and KH7 was dissolved in 10% cremophor, 
2.5% DMSO and saline. Vehicles contained the same amounts of solvents. All drugs 
were prepared freshly before the experiments. 

For in vitro experiments, PTX, HCO3~ and forskolin were dissolved in water. 
KH7, HU210 and WIN were dissolved in DMSO. THC, oligomycin, FCCP, anti- 
mycin and rotenone were dissolved in ethanol. Corresponding vehicle solutions 
were used in control experiments. DMSO and ethanol were no more than 0.001%. 

Doses and concentrations of the different drugs were chosen on the basis of 

previous published data or preliminary experiments. 
Plasmids. The N-terminal deletion of the first 22 amino acids (66 base pairs) in the 
mouse CB)-receptor coding sequence, to obtain the DN22-CB, mutant, as well as 
the generation of the mitochondrially targeted constitutively active form of PKA 
(MLS-PKA-CA) was achieved by polymerase chain reaction (PCR). 

In brief, for DN22-CB, a forward primer hybridizing from the 67th base starting 
from ATG was coupled to a reverse primer hybridizing to the end of the coding 
sequence, including the TGA stop codon. In order to guarantee accurate transla- 
tion of the construct, the forward primer included an ATG codon upstream of the 
hybridizing sequence. The cDNA for DN22-CB, was amplified using HF Platinum 
DNA polymerase (Invitrogen) and inserted into a PCRII-Topo vector (Invitrogen) 
according to the manufacturers’ instructions. The absence of amplification mis- 
matches was then verified by DNA sequencing. Primers used were: forward, with 
the inserted ATG in bold, 5/-ATGGTGGGCTCAAATGACATTCAG-3’; reverse, 
with the stop codon in bold, 5’-TCACAGAGCCTCGGCAGACGTG-3’. The 
cDNA sequence for CB, or DN22-CB, was inserted into a modified version of 
a pcDNA3.1 mammalian expression vector using BamHI-EcoRV according to 
standard cloning procedures. This modification allowed the co-expression of CB, 
or DN22-CB, with an mCherry fluorescent protein for control of transfection 
efficiency. For the study of mitochondrial motility, the coding sequence of CB, or 
DN22-CB, was fused to GFP using the pEGFP-N1 vector (Addgene) according to 
the manufacturer's instructions. 

For MLS-PKA-CA, a forward primer including a restriction site after the 
initial ATG codon for future subcloning with mitochondrial leading sequences 
was coupled to a reverse primer hybridized to the end of the coding sequence 
of the catalytic subunit of PKA (pET15b PKA Cat from Addgene)*, including a 
myc epitope and a TGA stop codon. Subsequently, the construct was subcloned 
into a pcDNA3.1 vector as an intermediate step and the QuikChange Multi Site- 
Directed Mutagenesis Kit (Agilent Genomics, Santa Clara, CA, USA) was used 
to mutate histidine-87 to glutamine, and tryptophan-196 to arginine to gener- 
ate a constitutively active form of PKA™. Finally, the construct was fused to a 
4x MLS sequence to target the constitutively active PKA to mitochondria (MLS- 
PKA-CA). The absence of amplification mismatches and confirmation of mutagen- 
esis was then verified by DNA sequencing. Primers used were: forward, with the 
inserted ATG in bold, 5’-TATCTGGATCCCTATGCAATTGGGCAACGCCG 
CCGCCGCCAAGAAGG-3’; reverse, with the stop codon and the myc epitope 
in bold, 5’-TATGATCTAGAGATCACAGATCCTCTTCTGAGATGAGTTTT 
TGTTCAAACTCAGTAAACTCCTTGCCACACTTC-3’; and for mutagenesis 
of H87, 5’-AAAGCAGATCGAGCAAACTCTGAATGAGAAG-3’; and W196 
5'-GITGAAAGGCCGTACTAGGACCTTGTGTGGGA-3’ (in bold are the mutated 
codons). 

The phosphomimetic version of NDUFS2 was custom synthesized by Eurofins 
Genomics (Germany). Briefly, the NDUFS2 sequence (NM_153064) was modified 


to obtain a phosphomimetic form mutating the 3 potential phosphorylation sites 
of PKA. The sites were chosen because consensus for their PKA phosphorylation 
nature was found between the two online available phosphorylation prediction 
algorithms, PhosphoMotif Finder (http://www.hprd.org/PhosphoMotif_finder) 
and PKA prediction site (http://mendel.imp.ac.at/pkaPS/)*>”°. By this approach, 
four sites were identified. One of these was excluded, because it is present on the 
mitochondrial leading sequence of NDUFS2. Thus, serines 296, 349 and 374 
were mutated to aspartic acid to obtain a phosphomimetic version of NDUFS2 
(NDUFS2-PM). A myc epitope was added at the C terminus of the protein for 
detection. 

Preparation of adeno-associated viruses (AAV). The cDNAs coding for mouse 
CB,, DN22-CB,, MLS-PKA-CA, NDUFS2-PM and for GFP were subcloned into 
the pAM-CBA vector using standard molecular cloning techniques. The resulting 
vectors were transfected by calcium phosphate precipitation into HEK293 cells 
together with the rAAV-helper-plasmid pFd6 and AAV1/2-serotype-packaging 
plasmids pRV1 and pH21 (ref. 34.). The viruses were then purified and titred as 
previously described*>*, Virus titres were between 10'° and 10"! genomic copies 
per ml for all batches of virus used in the study. 

Cell culture and transfection. All cell lines were originally obtained from ATCC 
(https://www.lgcstandards-atcc.org/Products/Cells_and_Microorganisms/Cell_ 
Lines.aspx?geo_country=fr). Mouse 3T3 cells (3T3 F442A), HeLa and HEK293 
cells were grown in Dulbecco modified Eagle's medium (DMEM) supplemented 
with 10% fetal bovine serum (FBS), 4.5 g [ glucose, 2mM glutamine, 1mM 
pyruvate. HEK293 cells were transfected with control plasmid, CB, or DN22-CB; 
cDNA coupled with mCherry cloned in pcDNA 3.1(+), respectively. Cells were 
transfected with sAC-HA or mtsAC-HA provided by G. Manfredi, (see refs 17, 21) 
or small hairpin RNA (shRNA) targeting AKAP121 provided by A. Feliciello 
(see ref. 22). HeLa cells were transfected with MLS-PKA-CA or NDUFS2-PM 
(see above). The transfections were carried out using FugeneHD (Roche, France) 
for 3T3 cells and polyethylenimine (PEI, Polysciences, USA) for HEK293 and HeLa 
cells, according to the manufacturers’ protocols. 

For biochemical experiments, primary hippocampal cultures were prepared 
from CB,*/*+ and CB,~/~ PO-P1 mice. Briefly, after mice were killed by 
decapitation, hippocampi were extracted in dissection medium (10 mM HEPES, 
0.3% glucose in Hank’s balanced salt solution, pH 7.4) and dissociated in 0.25% 
trypsin for 30 min. Where indicated, dissociated cells were transfected with 
sAC-HA using the Amaxa P3 primary cell 4D-nucleofector kit (Lonza, France), 
according to the manufacturer's protocol. Cells were plated on poly-.-lysine-coated 
96-well dishes using neurobasal/B27 medium (supplemented with 5% FBS, 2mM 
L-glutamine, 1mM pyruvate, 1 mM sodium lactate, 0.3% glucose and 37.5mM 
NaCl) at a density of 50,000 cells per well. One hour after plating, the serum was 
removed. Primary hippocampal cultures contained both neurons and astrocytes, 
and were used at 3 days in vitro (DIV). 

For live imaging of mitochondrial mobility, primary hippocampal cultures 
were prepared from CB,~/~ PO-P1 mice. Brains were extracted in PBS containing 
0.6% glucose and 0.5% bovine serum albumin (BSA) and the hippocampi were 
dissected. To dissociate cells, a kit for dissociation of postnatal neurons was used 
following the manufacturer's instructions (Milteny Biotec, France). Cells were 
seeded onto 0.5 mg ml! poly-t-lysine-coated 35-mm glass-bottom dishes (MatTek 
Corporation, France) for live imaging in neurobasal medium (Gibco, France) 
containing 2mM t-glutamine, 120g ml“! penicillin, 200 1g ml“! streptomycin 
and B27 supplement (Invitrogen, France), and were maintained at 37°C in the 
presence of 5% COy. Cells were cultured for 7 to 9 days. Neuron transfection was 
carried out at 4-5 DIV, using a standard calcium phosphate transfection protocol, 
with a 1:2 DNA ratio of plasmids expressing pDsred2-mito*” to GFP, CB,-GFP or 
DN22CB;-GFP, respectively. Axonal mitochondrial mobility was recorded 72-96 h 
after transfection (see below). Cannabinoid treatments altered the percentage of 
axonal mobile mitochondria, without altering velocity, dwelling time or travelled 
distance (data not shown). 

Mouse fibroblasts were generated from PO-P1 CB, /~ pups. After mice were 
killed by decapitation, the dorsal skin was excised and minced in PBS. Cells 
were then separated by incubation in 0.25% trypsin solution in PBS, collected 
by centrifugation and resuspended in DMEM with 10% fetal bovine serum, 
1% L-glutamine and 2% penicillin/streptomycin solution (Invitrogen, France). Cells 
were seeded in 25-cm? flasks and then expanded in 75-cm? flasks until reaching 
90% confluence. Transfections were carried out by using a BT'X-electroporator 
ECM 830 (Harvard Apparatus, France) (175 V, 1-ms pulse, five pulses, 0.5-s interval 
between pulses). Cells were electroporated in Optimem medium (Invitrogen, 
France) at 2 x 10’ cells per ml (fibroblasts from two 75 cm? flasks at 90% 
confluence in 300 il) in a 2-mm gap cuvette using 30 1g of either control plasmid 
(mCherry), CB; or DN22-CB,; cDNA coupled with mCherry, respectively. After 
electroporation, cells were resuspended in DMEM with 10% fetal bovine serum, 
1% L-glutamine and 2% penicillin/streptomycin solution (Invitrogen, France) and 
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seeded in three 100 cm? Petri dishes. All cells were maintained at 37°C and 5% CO} 
and collected 48h after transfection for respiration experiments. 

Isolation of mitochondria and cell fractionation. The brains of CB;*’* and 
CB,~'~ littermates were dissected and mitochondria were purified using a Ficoll 
gradient as previously described”*. In brief, brains were extracted in ice-cold 
isolation buffer (250 mM sucrose, 10mM Tris, 1 mM EDTA, pH 7.6) containing 
protease inhibitors (Roche, France) and 2M NaF and homogenized with a Teflon 
potter. Homogenates were centrifuged at 1,500g for 5 min (4°C). The supernatant 
was then centrifuged at 12,500g (4°C). The pellet was collected and the cycle of cen- 
trifugation was repeated. To purify mitochondria, the final pellet was resuspended 
in 4001 of isolation buffer, layered on top of a discontinuous Ficoll gradient (10% 
and 7% fractions) and centrifuged at 100,000g for 1h (4°C). Purified mitochondria 
were recovered from the pellet obtained after ultracentrifugation. All experiments 
using freshly isolated brain mitochondria were performed within 3h after puri- 
fication. 

The 3T3 cells were collected, resuspended in isolation buffer and disrupted 

with 25 strokes using a 25G needle. The total cell lysate was centrifuged at 500g 
(4°C) to remove cells debris and nuclei. The supernatant was kept and centrifuged 
at 12,500g for 10 min (4°C). The supernatant was then kept (cytosolic fraction), 
the pellet was resuspended, and the centrifugation cycle was repeated. Finally, the 
mitochondrial fractions were obtained from the last pellet. 
Oxygen consumption measurements. The oxygen consumption of isolated 
mitochondria, homogenized hippocampus and cell lines was monitored at 37 °C 
in a glass chamber equipped with a Clark oxygen electrode (Hansatech, UK). 
Purified mitochondria (75-100 1g) were suspended in 500 1] of respiration buffer 
(75mM mannitol, 25mM sucrose, 10mM KCl, 10mM Tris-HCl pH 7.4, 50 mM 
EDTA) in the chamber. Respiratory substrates were added directly to the chamber. 
Pyruvate (5 mM), malate (2 mM) and ADP (5mM) were successively added to 
measure complex-I-dependent mitochondrial respiration. Complex-II-dependent 
respiration was measured using rotenone (0.51M), succinate (10 mM) and ADP 
(5mM). Complex-IV-dependent respiration was measured using N,N,N‘N’/- 
tetramethyl-p-phenylenediamine (TMPD, 0.5 mM) and ascorbate (2 mM), in the 
presence of ADP (5mM) and antimycin A (2.5|1M). Complex-I-dependent respi- 
ration was evaluated, unless stated otherwise. 

For respiration in homogenized hippocampi, both hippocampi of each mouse 
were dissected and homogenized with a Teflon potter in 800 11 Mir05 buffer 
(Mitochondrial Physiology Network: 0.5mM EGTA, 3mM MgCl, 60mM 
lactobionate, 20 mM taurine, 10 mM KH,PO,, 20mM HEPES, 110mM sucrose 
and 1 g1~' BSA) containing 12.5 1g ml! of saponin. Subsequently, 15 11 of the 
homogenate was diluted in 1 ml Mir05 buffer and the oxygen consumption was 
measured with the respiratory substrates pyruvate (5 mM), malate (2mM) and ADP 
(5mM) to measure complex-I-dependent mitochondrial respiration before and 
after WIN (100nM) or vehicle addition. Oligomycin (21g ml~!), FCCP (0.541M), 
rotenone (0.5,1M) and antimycin A (2.5,1M) were injected subsequently into the 
chamber as modifiers of the respiration. 5011 of the homogenate were saved for 
WBand protein quantification experiments. 

The experiments using cell lines were performed on 2 x 10° cells ml“! in 
growth medium. Intact cells were transferred directly into the chamber and 
basal respiration was recorded. Drugs were added directly into the chambers. 
Mitochondria were incubated with PTX, KH7 and H89 for 5 min before addition 
of CB, agonists. HCO; and 8-Br-cAMP were added 5 min after the addition of 
CB, agonists. 

Oxygen consumption of primary hippocampal cultures was monitored using an 
XF96 Seahorse Bioscience analyser (Seahorse Bioscience, Denmark), according to 
the manufacturer’s protocol. When indicated, oligomycin (2,.g ml!) and FCCP 
(1M) were injected directly into the wells. Other drugs were directly added into 
the medium 1h before measurements. 

Respiration of HEK293 cells co-expressing CB, and NDUFS2-PM or MLS- 
PKA-CA was analysed using the Oxygraph-2k (Oroboros Instruments, Austria). 
These experiments were performed on 5 x 10° cellsml~! in growth medium. WIN 
was directly added into the medium 30 min before measurements. Then, intact 
cells were transferred into the 2-ml chamber and basal respiration was recorded. 
NADH-ubiquinone oxidoreductase (complex I) activity assay. NADH 
oxidation into NAD* by the first complex of the respiratory chain is coupled to the 
reduction of ubiquinone (coenzyme Q). The rate of this reaction is analysed by the 
measurement of NADH disappearance, which is spectrophotometrically detected 
(SAFAS, UVmc2) at 340 nm. The NADH extinction coefficient is 6.22mM~!cm7!. 
Final composition of the reaction solution was 50 mM K,HPO, pH 7.2, 2.5mg ml! 
BSA, 0.1mM ubiquinone and 200 ,g total cell extract proteins or 50\1g purified 
brain mitochondria. The reaction was initiated by adding 0.1 mM NADH. The 
assay was monitored at 37 °C for 5 min. 

ATP content measurements. The intracellular ATP content was measured using 
the bioluminescent ATP kit HS II (Roche, France). CB;*/* and CB,~/~ primary 
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hippocampal cultures (50,000 cells per well in a 96-well dish) were treated with 
THC (11M), WIN (11M) or vehicle in the presence or absence of rotenone (0.1 |1.M) 
for 1h. Then, ATP measurements were performed as previously described**. 
In brief, cells were lysed to release the intracellular ATP using the lysis buffer 
provided with the kit (equal volume) for 20 min. The lysate was then analysed 
in a 96-well plate luminometer (Luminoskan, Thermo Scientific, France) using 
the luciferine/luciferase reaction system provided with the kit. For this, 100 1l 
of luciferine/luciferase was injected in the wells and after 10s of incubation, 
bioluminescence was read (1s integration time). Standardizations were performed 
with known quantities of standard ATP provided with the kit. The ATP content 
derived from mitochondria was determined by subtracting ATPyotenone Values from 
the ATProtals (ATPmnito = ATPrtotal — ATPyotenone)+ 

Trypsin-sensitivity assays. 100\1g of mitochondria were suspended in isolation 
buffer, untreated or incubated with 0.01% trypsin in the presence or absence of 
0.05% triton X-100 for 15 min at 37°C. Proteins were then processed for western 
immunoblot analyses. 

Immunoprecipitation assays. Freshly purified brain mitochondria were resus- 
pended in PBS (5mg ml!) supplemented with protease inhibitor cocktail (Roche, 
France) and 2mM Naf, and solubilized with 1% lauryl maltoside for 30 min (4°C). 
For co-immunoprecipitation of sAC and G,, mitochondria were incubated with 
THC (800 nM) or vehicle for 5 min at 37°C. Proteins were incubated with a 
C-terminal anti-CB, antibody (Cayman, USA) or sAC R21 antibody (CEP Biotech, 
USA) overnight (4°C). For immunoprecipitation of complex I, mitochondrial 
proteins were treated with THC (800nM), HCO; (5mM), 8-Br-cAMP (500M) 
or vehicle for 5 min at 37°C and then incubated with complex-I-agarose-conjugated 
beads (Abcam, UK). Protein A/G PLUS-agarose beads (Santa Cruz, USA) were 
then added and the incubation continued for 4h (4°C). The elution was performed 
using glycine buffer (0.2 M glycine, 0.05% lauryl maltoside pH 2.5) and samples 
were processed for western immunoblotting. 

ERK phosphorylation assays. Following transfection (mCherry, CB, or 
DN22-CB,, respectively), cells were allowed to recover in serum containing 
medium for 24h. Cells were then starved overnight in serum-free DMEM before 
treatment and lysis. The cells were then treated at 37°C with HU210 (100 nM) 
or vehicle for 10 min. The medium was rapidly aspirated and the samples were 
snap-frozen in liquid nitrogen and stored at —80°C before preparation for western 
blotting. 

Protein extraction and western immunoblotting. For ERK-phosphorylation 
assays, lysis buffer (1 mM EGTA, 50 mM Nak, 1mM Na3VOq, 50 mM Tris pH 7.5, 
1% triton X-100, protease inhibitors, 30 mM 2-mercaptoethanol) was added and 
the cells were collected by scraping and pelleted by centrifugation at 12,500g (4°C) 
for 5 min to remove cell debris. Protein concentrations were measured using the 
Pierce BCA protein assay kit (Thermo Scientific), loaded with Laemmli buffer 
and kept at —80°C. 

For western immunoblotting, the proteins were separated on Tris-glycine 7%, 
10% or 12% acrylamide gels and transferred to PVDF membranes. Membranes 
were soaked in 5% milk (5% BSA for phosphorylation immunoblots) in tris- 
buffered saline (TBS; Tris 19.82 mM, NaCl 151 mM, pH 7.6) containing tween20 
(0.05%). Mitochondrial proteins were immunodetected using antibodies against 
complex III core 2 (Abcam, ab14745; 1:1,000, 1h, room temperature), succinate 
dehydrogenase subunit A (Abcam, ab14715; 1:10,000, 1h, room temperature), 
NDUFA9 (Abcam, ab14713; 1:1,000, 1h, room temperature), NDUFS2 (Abcam, 
ab110249; 1:1,000, 1h, room temperature) and TOM20 (Santa Cruz, sz-11415; 
1:1,000, 1h, 4°C). Cytosolic proteins were probed with LDHa (Santa Cruz, 
sz-137243; 1:500, overnight, 4°C). Samples were also probed with antibodies 
against G, proteins (Enzo Life Science, SA-126; 1:1,000, 1h, room temperature), 
sAC (CEP Biotech, sAC R21; 1:500, overnight, 4°C), PKA (cAMP protein 
kinase catalytic subunit, Abcam, ab76238; 1:1,000, 1h, room temperature), an 
antiserum directed against the C terminus of CB, receptor (Cayman, 10006590; 
1:200, overnight, 4°C), AKAP121 (from A. Feliciello; 1:1,000, overnight, 4°C), 
PKA-dependent phosphorylation sites (phospho (Ser/Thr)-PKA substrate, 
Cell Signaling, 9621; 1:1,000, overnight, 4°C) and HA (Abcam, ab18181; 1:500, 
overnight, 4°C), p-ERK (phospho-p44/42 MAPK) corresponding to residues 
around Thr202/Tyr204 (Cell Signaling, 4370; 1:1,000, overnight, 4°C), ERK (p44/ 
p42 MAPK; Cell Signaling, 9102; 1:2,000, 1h, room temperature). 

Mitochondrial proteins were also separated by two-dimension electrophoresis as 
described*’. Purified brain mitochondria were solubilized (10 mg ml~!) in 0.75M 
aminocaproic acid, 50 mM BisTris, (pH 7.0) with 1.5% n-dodecyl-maltoside for 
30 min on ice, and were then centrifuged at 16,000g (4°C). The supernatant was 
collected and supplemented with 0.25% coomassie blue G and protease inhibitors 
(Roche, France). Proteins were then separated with 4-16% gradient native-PAGE 
gels (Invitrogen, France). The different lanes were cut out and processed for the 
second dimension on 12.5% SDS-PAGE gels after denaturation and reduction in 
1% (w/v) sodium dodecyl sulphate and 1% (v/v) mercaptoethanol. The second 
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dimension gels were immunoblotted for detection of PKA-dependent phosphoryl- 
ated proteins. A second-dimension gel was kept for coomassie blue staining. Then, 
membranes were washed and incubated with appropriate secondary horseradish 
peroxidase (HRP)-coupled antibodies (1h, room temperature). Finally, the HRP 
signal was detected using the ECL-plus reagent (Amersham) and the Bio-Rad 
Quantity One system. Labelling was quantified by densitometric analysis using 
Image] (NIH) software. 

Immunocytochemistry. HeLa cells were fixed in 4% formaldehyde dissolved 
in PBS (0.1 M, pH 7.4) and then washed with PBS. Cells were pre-incubated 
in a blocking solution of 10% normal goat serum, 0.1% triton X-100, 0.05% 
deoxycholate and 0.2 M glycine prepared in PBS for 1h and then incubated with 
primary antibody rabbit anti-TOM20 (Santa Cruz, sc-11415; 1:500) and mouse 
anti-myc (Roche, 11667149001; 1:500) for 2h in the same blocking solution. The 
cells were then washed in PBS for 1h and were then incubated with fluorescent 
anti-mouse Alexa488 or anti-rabbit Alexa561 (Jackson ImmunoResearch; 
1:800) in blocking solution for 1h. Finally, cells were washed and mounted with 
fluoromont-G (Electron Microscopy Sciences). All the procedures were carried 
out at room temperature. The cells were analysed with a Confocal Leica DM16000 
microscope (Leica). 

Proteomic analyses. Samples were digested by trypsin as previously described“. 
Peptides were further analysed by nano-liquid chromatography coupled to a MS/ 
MS LTQ-Orbitrap XL mass spectrometer (Thermo Fisher Scientific, Germany). 
Peptides were identified with SEQUEST and MASCOT algorithms through the 
Proteome Discoverer interface (Thermo Fisher Scientific, Germany) against 
a subset of the UniProt database restricted to Reference Proteome Set of Mus 
musculus (UniProtKB Release 2011_12, 14'* December, 2011, 46,638 entries). 
Peptide validation was performed using Percolator algorithm“! and only ‘high 
confidence’ peptides were retained corresponding to a 1% false positive rate at 
peptide level. 

PKA activity and cAMP content assays. Cyclic AMP levels and PKA activity of 
mitochondria isolated from the brain were assayed using the Direct Correlate- 
EIA cAMP kit (Assay Designs Inc., USA) and an ELISA kit (Enzo Life Science), 
respectively, according to the manufacturers’ instructions. The different treatments 
described in the main text were performed at 37°C for Lh. 

Imaging and quantification of axonal mobility of mitochondria. Mitochondrial 
mobility in hippocampal neurons was recorded using an inverted Leica DMI6000 
microscope (Leica Microsystems, Wetzlar, Germany) equipped with a confocal 
head Yokogawa CSU-X1 (Yokogawa Electric Corporation, Tokyo, Japan) and a 
sensitive Quantem camera (Photometrics, Tucson, USA). The diode lasers used 
were at 491 nm and 561 nm and the objective was a HCX PL APO CS 63 x oil 1.32 
NA lens. The z stacks were obtained with a piezo P721.LLQ (Physik Instrumente 
(PI), Karlsruhe, Germany). The 37°C atmosphere during time-lapse image 
acquisition was created with an incubator box and an air heating system (Life 
Imaging Services, Basel, Switzerland) in the presence of 5% CO . This system was 
controlled by MetaMorph software (Molecular Devices, Sunnyvale, USA). 

For mitochondrial axonal transport analysis, time-lapse series of image stacks 
composed of 6 images (512 x 512 pixels) were taken every 3s for 15 min. HU210 
was added just after the recording and 15 min later the same neuron was recorded 
for another 15 min. KH7 or vehicle were added 15 min before the first recording. 
All stacks obtained were processed first with MetaMorph software. Further image 
processing, analysis and video compilation (28 frames per second) and editing was 
done with Image] software (NIH, USA). Kymographs were generated with the 
KymoToolBox Plugin®’. Between 10 and 32 axons were registered and analysed in 
each condition. In all cases, a mitochondrion was considered mobile when it moved 
more than 51m during the time of recording. Distances and speeds of retrograde 
and anterograde transport and dwelling time were measured separately from the 
corresponding kymographs, as previously described*?*, 

(SH]CP55,940 and [*°S]GTP-S binding studies. Cell membrane microarray 
development. The microarrays were composed of a collection of membrane 
homogenates isolated from HEK293 cells transfected with mCherry CB, or 
DN22-CB,, or from hippocampi of CB;+/*(GFP), CB,;~/~(GEP), CB; ~/~(CB,) or 
CB,’ (DN22-CB;) mice (see below), together with increasing amounts of BSA 
and membranes isolated from rat cerebral cortex, as positive internal controls*?. 
Briefly, samples were homogenized using a Teflon-glass grinder (Heidolph RZR 
2020) or a disperser (Ultra-Turrax T10 basic, IKA) in 20 volumes of homogenized 
buffer (1mM EGTA, 3mM MgCh, and 50 mM Tris-HCl pH 7.4) supplemented 
with 250 mM sucrose. The crude homogenate was subjected to a 40g centrifugation 
for cells or 200g for tissue for 5 min, and the resultant supernatant was centrifuged 
again at 18,000g for 15 min (4°C, Microfuge 22R centrifuge, Beckman Coulter). 
The pellet was washed in 20 volumes of homogenized buffer and re-centrifuged 
under the same conditions. The homogenate aliquots were stored at —80°C until 
use. Protein concentration was measured by the Bradford method and adjusted 


to the required concentrations. Microarrays were fabricated by a non-contact 
microarrayer (Nano_plotter NP 2.1) placing the cell membrane homogenates (4nl 
per spot, 3-5 replicates per sample) onto glass slides**. Microarrays were stored 
at —20°C until use. 
[H]CP55,940 binding assays. After thawing, cell membrane microarrays were 
incubated in assay buffer (50 mM Tris-Cl; 1% BSA; pH 7.4) for 30 min at room 
temperature. A second incubation was performed using the same buffer for 
120 min at 37°C in the presence of [7H]CP55,940 (3nM). Non-specific binding 
was determined with 101M WIN55,212-2. Afterwards, microarrays were washed 
twice in buffer, dipped in deionized water and dried. Finally, they were exposed to 
films, developed, scanned and quantified as described below. 
[®S]GTPYS binding studies. [?°S]GTP4S binding studies were carried out 
according to the patented methodology for the screening of molecules that act 
through G-protein-coupled receptors using cell membrane microarrays”. Briefly, 
thawed cell membrane microarrays were dried 20 min at room temperature and 
were subsequently incubated in assay buffer (50 mM Tris-Cl; 1 mM EGTA; 
3mM MgCl; 100 mM NaCl; 0,5% BSA; pH 7.4) for 15 min at room temperature. 
Microarrays were transferred into assay buffer containing 501M GDP and 0.1nM 
[>S]GTP)S, with the cannabinoid agonists WIN55,212-2 or HU210, at increasing 
concentrations, and incubated at 30°C for 30 min. Non-specific binding was 
determined with GTPS (101M). After washing, microarrays, together with 
ARC [!4C]-standards, were exposed to films, developed, scanned and quantified. 
The protein concentration in each spot was measured by the Bradford method 
and used to normalize the [*°S]GTP-S binding results to nCi per ng protein. 
Data from the dose-response curves (5 replicates in triplicate) were analysed 
using the program Prism (GraphPad Software Inc., San Diego, CA) to yield ECs 
(effective concentration 50%) and Emax (maximal effect) of the drugs on each 
different sample by nonlinear regression analysis. Samples displaying [*H] 
CP55,940 binding below the values of hippocampi from CB,~/~ mice were 
excluded from [*°S]GTP\S binding analysis. 
Surgery and drug administration. Mice (7—9 weeks of age) were anaesthetized 
by ip. injection of a mixture of ketamine (100 mg kg '; Imalgene 500, Merial) and 
Xylazine (10 mg kg~!; Rompun, Bayer) and placed into a stereotaxic apparatus 
(David Kopf Instruments) with mouse adaptor and lateral ear bars. 
For intracerebroventricular injections of drugs, mice were unilaterally implanted 
with a 1.0-mm stainless-steel guide cannula targeting the lateral ventricle with the 
following coordinates: anterior-posterior —0.2; lateral + 0.9; dorsal-ventral —2.0. 
For intrahippocampal injections of drugs, mice were bilaterally implanted with 
1.0-mm stainless-steel guide cannulae targeting the hippocampus with the 
following coordinates: anterior—posterior —3.1; medial-lateral + 1.3; dorsal- 
ventral —0.5. Guide cannulae were secured with cement anchored to the skull 
by screws. Mice were allowed to recover for at least one week in individual 
cages before the start of experiments. Mice were weighed daily and individuals 
that failed to return to their pre-surgery body weight were excluded from sub- 
sequent experiments. The intrahippocampal and intracerebroventricular drug 
injections were performed by using injectors protruding 1 mm from the tip of 
the cannula. 
In vivo viral re-expression of CB; and DN22-CB,. For viral intrahippocampal 
AAV delivery, mice were submitted to stereotaxic surgery (as above) and AAV 
vectors were injected with the help of a microsyringe (0.25-ml Hamilton syringe 
with a 30-gauge bevelled needle) attached to a pump (UMP3-1, World Precision 
Instruments). Mice were injected directly into the hippocampus (0.511 per 
injection site at a rate of 0.5, per min), with the following coordinates: dorsal 
hippocampus, anterior—posterior —1.8; medial-lateral + 1; dorsal-ventral —2.0 
and —1.5; ventral hippocampus: anterior—posterior —3.5; medial-lateral + 2.7; 
dorsal-ventral —4 and —3. Following virus delivery, the syringe was left in place 
for 1 min before being slowly withdrawn from the brain. CB; */* mice were injected 
with AAV-GFP to generate CB,‘/'(GFP) mice; CB, ‘~ mice were injected with 
AAV-GEP, AAV-CB, or AAV-DN22-CB,, to obtain CB, ~/~ (GFP), CB; ~/~(CB;) 
and CB,~'~(DN22-CB)) mice, respectively. Animals were used for experiments 4—5 
weeks after injections. Mice were weighed daily and individuals that failed to return 
to their pre-surgery body weight were excluded from subsequent experiments. 
CB,-receptor expression was verified by fluorescent or electromicroscopic 
immunohistochemistry (see below). 
In vivo viral modulation of mitochondrial PKA and NDUFS2 activities. The 
AAV vectors, MLS-PKA-CA or NDUFS2-PM were injected directly into the dorsal 
hippocampus (1.011 per injection site at a rate of 0.511 per min) of C57BL/6N 
mice, with the following coordinates: anterior-posterior — 1.8; medial-lateral + 1; 
dorsal-ventral —2.0 and —1.5. Following virus delivery, the syringe was left in place 
for 1-2 min before being slowly withdrawn from the brain. Animals were used for 
experiments 4—5 weeks after viral delivery. Mice were habituated to i-p. injections 
(saline) before the behavioural paradigm (see below). The hippocampal expression 
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of myc-tagged MLS-PKA-CA and NDUFS2-PM was verified by immunohisto- 
chemistry using anti-myc antibodies. 

Fluorescence immunohistochemistry. Mice were anaesthetized with chloral 
hydrate (400 mg kg! body weight), transcardially perfused with Ringer solution 
(NaCl (135 mM), KCl (5.4mM), MgCl,-6H2O (1mM), CaCls-2H20 (1.8mM), 
HEPES (5 mM)). Heparin choay (25,000 UI per 5 ml) was added extemporarily 
and tissues were then fixed with 500 ml of 4% formaldehyde dissolved in PBS 
(0.1 M, pH 7.4) and prepared at 4°C. After perfusion, the brains were removed and 
incubated several additional hours in the same fixative. Serial vibrosections were 
cut at 40-50-|1m thickness and collected in PBS at room temperature. Sections 
were pre-incubated in a blocking solution of 10% donkey serum, 0.1% sodium 
azide and 0.3% triton X-100 prepared in PBS for 30 min-1h at room temperature. 
Free-floating sections were incubated for 48 h (4°C) with goat anti-CB, polyclonal 
antibodies raised against a C-terminal sequence of 31 amino acids (NM007726) of 
the mouse CB, receptor (CB;-Go-Af450-1; 21g ml” |; Frontier Science Co. Ltd) or 
overnight (4°C) with rabbit anti-myc (Ozyme; 1:1,000). The antibody was prepared 
in 10% donkey serum in PBS containing 0.1% sodium azide and 0.5% triton X-100. 
Then, the sections were washed in PBS for 30 min at room temperature. The tissue 
was subsequently incubated with fluorescent anti-goat Alexa488 (1:200, Jackson 
ImmunoResearch) for 4h and washed in PBS at room temperature, before being 
incubated with DAPI (1:20,000) for 10 min for nuclear counterstaining. Finally, 
sections were washed, mounted, dried and a coverslip was added on top with 
DPX (Fluka Chemie AG). The slides were analysed with an epifluorescence Leica 
DM6000 microscope (Leica). 

Immuno-electron microscopy. CB;*/*(GFP), CB; /~ (GFP), CB; /~(CB,) 
and CB,~'~(DN22-CB}) mice (n =3 per group) were processed for electron 
microscope pre-embedding immunogold labelling as previously described”. 
Immunodetection was performed in 50-j1m-thick sections of hippocampus with 
goat anti-CB, polyclonal antibodies raised against a 31 amino acid C-terminal 
sequence (NM007726) of the mouse CB, receptor (Frontier Institute Co. Ltd, 
CB,-Go-Af450-1; 2pg ml~!). 

Semiquantification of mtCB,. Immunogold particles were identified and counted. 
To exclude the risk of counting possible false positive mitochondrial labelling, we 
used strict semi-quantification methods of mtCB, receptors as recently described, 
excluding immunogold particles that were located on mitochondrial membranes 
but at a distance <80nm from other cellular structures®. The normalized number 
of immunogold particles located on mitochondria versus the total amount of 
immunogold particles in each field was used to calculate the proportion of mtCB; 
receptors over total CB). 

Field excitatory post-synaptic potentials ((EPSP) recording. Mice were anaes- 
thetized with isoflurane and killed by decapitation. Brains were rapidly removed 
and chilled in an ice-cold, carbonated (bubbled with 95% O,-5% CO2) cutting 
solution containing 180 mM sucrose, 2.5mM KCl, 0.2mM CaCh, 12mM MgCh, 
1.25mM NaH»POg,, 26mM NaHCO; and 11 mM glucose (pH 7.4). Sagittal hip- 
pocampal slices (350-j1m thick) were cut using a Leica VT1200S vibratome and 
incubated with artificial cerebrospinal fluid (ACSF) containing 123 mM NaCl, 
1.25mM NaH>P04, 11 mM glucose, 2.5mM KCl, 2.5mM CaCh, 1.3mM MgCl, 
and 26mM NaHCO; (osmolarity of 298 + 7; pH 7.4) for 30 min at 34°C. The slices 
were subsequently transferred to a holding chamber, where they were maintained 
at room temperature until experiments. Slices were individually transferred to a 
submerged chamber for recording and continuously perfused with oxygenated 
(95% O2-5% CO2) ACSF (3-5 ml min™!). All experiments were performed at room 
temperature. fEPSPs were recorded using glass micropipettes (2-4 mQ) filled with 
normal ACSF positioned in the CA1 hippocampal region. Slices from the middle 
hippocampus were used preferentially. fEPSPs responses were evoked by stimula- 
tion (0.1-ms duration, 10-30-V amplitude) delivered to the stratum radiatum to 
stimulate the Schaffer collateral fibres using similar glass electrodes used for the 
recordings, in the presence of picrotoxin 100\1M. Recordings were obtained using 
an Axon Multiclamp 700B amplifier (Molecular Devices). Signals were filtered at 
2kHz, digitized, sampled and analysed using Axon Clampfit software (Molecular 
Devices). In CB,~/~(CB,) and CB,~/~(DN22-CB)), two slices (1 each) were 
excluded from analysis, because immunohistochemistry showed no re-expression. 
Novel object-recognition memory task. To study the effect of mtCB; receptor 
signalling on cannabinoid-induced amnesia, we used the hippocampal-dependent 
NOR memory task in an L-maze (L-M/NOR)'**78, As compared to other 
hippocampal-dependent memory tasks, this test presents several advantages for 
the aims of the present study: (i) the acquisition of L-M/NOR occurs in one step 
and previous studies revealed that the consolidation of this type of memory is 
deeply altered by acute immediate post-training administration of cannabinoids 
via hippocampal CB, receptors!***; (ii) this test allows repeated independent 
measurements of memory performance in individual animals"’, thereby allowing 
within-subject comparisons, eventually excluding potential individual differences 
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in viral infection and/or expression of proteins; (iii) notably, CB; -/~ mice do 
not respond to the administration of cannabinoids, but they do not show any 
spontaneous impairment of performance in L-M/NOR", thereby allowing the use 
of re-expression approaches to study the role of hippocampal mtCB, receptors in 
the cannabinoid-induced blockade of memory consolidation. 

This task was performed with an L-maze made out of dark-grey Plexiglas 

with two corridors (35cm and 30cm long, respectively, for external and internal 
V walls, 4.5 cm wide and 15-cm high walls) set at a 90° angle and under a weak light 
intensity (50 Lux). The task consisted of 3 sequential daily trials of 9 min. Day 1 
(habituation): mice were placed at the intersection of the two arms and were let free 
to explore the maze. Day 2 (acquisition): two identical objects were placed at the 
end of each arm. After 9 min of exploration, mice were removed and injected. Day 3 
(retrieval): A novel object different in its shape, colour and texture was placed at 
the end of one of the arm, whereas the familiar object remained at the end of the 
other arm. The position of the novel object and the pairings of novel and familiar 
objects were randomized. Exploration of each object was scored off-line by at least 
two experienced observers blind to treatments and/or genotypes. Exploration was 
defined as the time spent by the mouse with the nose pointing to the object at a 
distance of less than 1 cm, whereas climbing on or chewing the object was not 
considered as exploration'*. Memory performance was assessed by the discrimi- 
nation index. The discrimination index was calculated as the difference between 
the time spent exploring the novel (TN) and the familiar object (TF) divided by 
the total exploration time (TN+TF): discrimination index = (TN—TF)/(TN+TE). 
Mice receiving the acute intrahippocampal infusion of KH7 (10 mM) and WIN 
(5mg kg!) ip., and mice that received intrahippocampal injection with AAV- 
MLS-PKA-CA or AAV-NDUFS2-PM were submitted to a single L-M/NOR 
session. Due to the limited numbers of available mice, null CB,;~/~ mice virally 
injected with AAV-GFP, AAV-CB, or AAV-DN22-CB, were tested twice with 
a one-week interval using different pairs of objects and treated the first time 
with vehicle and the following with WIN. Every pair of objects was previously 
screened to exclude that the animals might exhibit significant preference for any 
specific item. After the NOR task, the hippocampi of vehicle-treated AAV-MLS- 
PKA-CA and AAV-NDUFS2-PM animals were dissected and used for respiration 
experiments (see above). Expression of the myc epitope was verified by immuno- 
histochemistry in the hippocampi of animals treated with WIN. 
Statistical analyses. All graphs and statistical analyses were performed using 
GraphPad software (version 5.0 or 6.0). Results were expressed as means of 
independent data points + s.e.m. For biochemical quantifications (CAMP levels, 
PKA and complex-] activities and oxygen consumption), data are presented as 
percentage of controls with or without the application of cannabinoid drugs. With 
the exception of KH7 (see Extended Data Fig. 5a), no other drugs and plasmids 
had any effect per se on any measured parameter (not shown). Data were analysed 
using paired or unpaired Student's t-test, one-way (followed by Tukey's post hoc 
test) or two-way ANOVA (followed by Bonferroni's post hoc test), as appropriate. 
Detailed statistical data for each experiment are reported in Supplementary 
Tables 1-3. 
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Extended Data Figure 1 | The mutant DN22-CB, receptor is 

functional, but it does not mediate cannabinoid-induced alterations 

of mitochondrial activity and mobility. a, Acute treatment with HU210 
(100 nM) induces changes in cellular oxygen consumption in intact 
primary fibroblasts (MFs) from CB,-'~ mice electroporated with a 

CB -expressing plasmid, whereas no alterations were observed in DN22- 
CB,-expressing or mock-electroporated (mCherry) CB,~/~ mouse 
fibroblasts (n = 6-10). b, HU210 (200 nM) treatment decreases HEK293 
cellular respiration in cells expressing CB,, but not mCherry or DN22-CB, 
receptors (n = 4). c, Representative immunoblotting showing the effect of 
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HU210 (100 nM) on pERK and ERK levels in HEK293 cells transfected 
with plasmids expressing mCherry, CB, or DN22-CB, (n=7-13). 
Quantitative data are shown in Fig. 1b of main text. d, Representative 
confocal images (green, GFP; red, pDsred2—mito) and kymographs 
corresponding to the results shown in Fig. 1d; showing how HU210 
(14M) reduces mitochondrial mobility in the primary neurons from 
CB,~'~ mice transfected with vectors expressing CB), but not GFP or 
DN22-CB). Scale bar, 101M. Data, mean +s.e.m.; *P < 0.05. For statistics, 
see Supplementary Tables 1-3. 
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Extended Data Figure 2 | Characterization of CB,*/* (GFP), 
CB,~'~(GEP), CB,~'~(CB,) and CB,~/~ (DN22-CB,) mice. 

a, Representative immunofluorescence micrographs showing CB,- 
receptor expression in hippocampal brain sections from CB,*/+(GFP), 
CB, -'~ (GFP), CB;~'~(CB,) and CB,;~'~(DN22-CB) mice, respectively. 
b, H]CP55,940 CB,-receptor binding experiments, showing similar 
levels of CB, protein expression in hippocampal extracts from 
CB,/~(CB,) and CB,‘ (DN22-CB,) mice, both significantly higher 
than control CB;~!~ (GFP) (n =7-24). ¢, Dose-response curves of 
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[°°S]GTP*S binding in membranes isolated from hippocampi of 

CB,~/~ (GFP), CB, -/~(CB,) and CB,;~'~ (DN22-CB,) mice treated with 
WIN (n= 9-24). d, e, CB)-receptor-dependent activation of G proteins is 
similar in hippocampi from CB,~/~(CB,) and CB,~'~(DN22-CB,) mice, 
as shown by ECs» dose and maximal activation values calculated from 
[?°S]GTP*S binding experiments after treatment with HU210 (d) or WIN 
(e). f, Total exploration time in NOR experiments as described in Fig. 1i. 


Data, mean 4 


t s.e.m.; ***P < 0.001; NS, not significant. For statistics, 


see Supplementary Tables 1-3. 
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Extended Data Figure 3 | Activation of mtCB, receptor affects 
mitochondrial activity via G,j/, signalling. a, THC (1 1M) or WIN 
(14M) decrease cellular respiration of primary hippocampal cultures 
derived from CB,*!* but not from CB,~/~ mice (n= 5-15). b, THC (11M) 
or WIN (11M) reduce both total and mitochondrial ATP levels of primary 
hippocampal cultures derived from CB,;*/* but not from CB;~/~ mice 
(n= 6-14). c, Quantification of data presented in Fig. 2b, showing the 
release of the G protein upon activation of mtCB, receptors (n = 3). 

d, e, Effects of pertussis toxin (PTX, 1 pg ml!) and THC (800 nM) on 
cAMP levels in purified brain mitochondria obtained from CB,*/+ mice 
(d) and CB,~‘~ littermates (e) (n= 4). f, g, Effects of PTX (1 jug ml~') and 


THC (800 nM) on PKA activity in purified brain mitochondria 

obtained from CB,*/+ mice (f) and CB,~/~ littermates (g) (n = 5-6). 

h, PTX (1g ml~') blocks the effect of THC (800 nM) and WIN (100 nM) 
on complex-I activity of purified brain mitochondria obtained from 
CB,*/* mice (n= 3-6). i, Representative trypsin-sensitivity assay 

(3 independent experiments) showing the intra-mitochondrial localization 
of G,, sAC and PKA. NDUFA9, a subunit of complex I; TOM20, 
translocase of outer membrane subunit 20. Data, mean +s.e.m.; *P < 0.05; 
**P < 0.01; ***P < 0.001; NS, not significant. For statistics, 

see Supplementary Tables 1-3. 
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indicating higher interaction between G, proteins and sAC after mtCB, cells. SDHA, succinate dehydrogenase A (subunit of complex II); TOM20, 
activation (n= 3). b, The transmembrane adenylyl cyclase activator translocase of outer membrane subunit 20. g, Representative trypsin- 


forskolin (10,1M) does not alter basal levels and does not reverse the effect sensitivity assay (3 independent experiments) showing that sAC-HA and 
of THC (800 nM) on cAMP content of CB;*!* brain mitochondria (n = 4). mtsAC-HA are localized inside mitochondria in transfected 3T3 cells. 

c, Forskolin (101M) has no effect on basal cAMP levels of CB;~/~ brain h, Representative immunoblot (3 independent experiments) showing 
mitochondria (n = 3-4). d, The sAC activator HCO; (5 mM) reverses the the expression of sAC-HA in transfected and non-transfected primary 
effect of THC (800 nM) on cAMP content in purified brain mitochondria hippocampal cultures. i, Expression of sAC-HA in primary hippocampal 


from CB,*!*+ mice (n= 4). e, HCO;~ (5mM) and THC (800 nM) have cultures blocks the effect of THC (11M) on basal cellular respiration 
no effect on cAMP levels (n = 3-4) of CB,~/~ brain mitochondria. (n= 8-10). Data, mean +s.e.m.; P< 0.05; **P< 0.01; ***P < 0.001; 
f, Representative trypsin sensitivity assay in mitochondrial fractions of NS, not significant. For statistics, see Supplementary Tables 1-3. 
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Extended Data Figure 5 | The sAC inhibitor KH7 prevents the effects 
of mtCB, activation. a, The effect of KH7 (51M) on mitochondrial 
respiration in purified brain mitochondria obtained from CB,*/* mice 
(n= 5-6). b, KH7 (54M) blocks the effect of THC (800 nM) on PKA 
activity in purified brain mitochondria obtained from CB;*/* mice 

(n= 5-11). c, KH7 (51M) blocks the effect of THC (800 nM) and WIN 
(100 nM) on complex-I activity in purified brain mitochondria obtained 
from CB,*'+ mice (n = 3-6). d, Representative confocal images (green, 
CB,-GFP; red, pDsred2-mito) of analysed axonal tracts and kymographs 
of the results shown in Fig. 3b, representing the effect of KH7 (541M) on 
the reduction in mitochondrial mobility in primary hippocampal neurons 
induced by HU210 (11M). Scale bar, 101M. e, mtsAC-HA blocks the 
decrease in mitochondrial mobility in CB|-expressing neurons induced by 


HU210 (11M) (n=7-18). f, The effect of KH7 (10|1M) on the decrease 
in fEPSP at CA3-CA1 synapses from C57BL/6N mice induced by WIN 
(541M). Left, plots of normalized fEPSP slopes in vitro with representative 
fEPSP traces before (1) and after (2) WIN incubation. Vehicle (n = 6) or 
KH7 (n=5) were pre-incubated 10 min before cannabinoid application. 
Right, histogram summarizing the average changes in the percentage 

of the fEPSP slope before (100% baseline, dotted line) and after WIN 
treatment in the presence or absence of KH7. g, KH7 (101M) alone does 
not affect the fEPSP slope at CA3-CA1 synapses from C57BL/6N mice. 
Plots of normalized fEPSP slopes in vitro before and after KH7 application 
(n=5).h, Total exploration time in NOR experiments as described in 
Fig. 3d. Data, mean + s.e.m.; *P < 0.05; ***P < 0.001; NS, not significant. 
For statistics, see Supplementary Tables 1-3. 
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Extended Data Figure 6 | Mitochondrial PKA activity mediates the 
effects of mtCB, receptors. a, The PKA inhibitor H89 (800 nM) blocks 
the effect of THC (800 nM) and WIN (100 nM) on complex-I activity 

in CB,*'* purified brain mitochondria (n = 3-6). b, H89 blocks the 

effect of THC on PKA activity in CB;*/* purified brain mitochondria 
(n=6). c, H89 and THC have no effect on PKA activity in CB,~/~ brain 
mitochondria (n = 3-5). d, Quantification of data presented in Fig. 4c 
(n= 3), indicating a lower mitochondrial PKA content after silencing of 
AKAP121.e, f, Silencing of AKAP121 lowers mitochondrial PKA activity. 
e, Representative immunoblots of proteins phosphorylated by PKA (PKA- 
dependent phosphorylation) in 3T3 cells transfected with control vector or 


shAKAKP121. SHDA, succinate dehydrogenase A (a subunit of complex 
11). f, Quantification of data presented in e (n = 12). g-i, Silencing of 
AKAP121 blocks the effect of THC on cellular respiration. g, Silencing of 
AKAP121 blocks the effect of THC (800 nM) on basal cellular respiration 
in 3T3 cells (n=4). h, Representative immunoblotting of AKAP121 
expression in extracts of hippocampal primary cultures in the presence or 
absence of shAKAP121. i, shAKAP121 inhibits the effect of THC on basal 
cellular respiration of primary hippocampal cultures (n = 5-10). Data, 
mean +s.e.m.; *P < 0.05; **P< 0.01; ***P< 0.001; NS, not significant. 
For statistics, see Supplementary Tables 1-3. 
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Extended Data Figure 7 | Activation of mtCB, decreases PKA- 
dependent phosphorylation of the complex-I subunit NDUFS2, 
eventually mediating the effects on respiration. a, THC specifically 
decreases complex-I- but not complex-II- or complex-IV-dependent 
respiration in purified brain mitochondria in a CB,-dependent manner 
(n= 3-4). b, Representative two-dimension (BN-PAGE/SDS-PAGE) 
immunoblotting (4 independent experiments) showing PKA-dependent 
phosphorylation of protein extracts of brain mitochondria from CB,;*/* 
and CB,~'~ mice treated with vehicle or THC (800 nM). ¢, Representative 
immunofluorescence of HeLa cells transfected with empty vector (mock), 
the mitochondrial-targeted constitutively active PKA (MLS-PKA-CA) 


or the phosphomimetic mutant of NDUFS2 (NDUFS2-PM). Green, myc 
tag staining; red, mitochondrial staining (TOM20). d, Baseline cellular 
respiration of HEK293 cells transfected with plasmids expressing CB), 
CB, + MLS-PKA-CA or CB; + NDUFS2-PM (n= 4-6). e, The effect of 
vehicle or WIN on cellular respiration in HEK293 cells as in d (n= 4-6). 
f, Representative images of hippocampi injected with AAV-GFP, 
AAV-MLS-PKA-CA or AAV-NDUEFS2-PM. Green, GFP fluorescence 
(left panel) or myc tag staining (central and right panel). Arrows, subunit 
of complex I, NDUFS2. ***P < 0.001 between genotypes (a) or between 
vehicle and WIN (d); **P < 0.01; ***P < 0.001 as compared to control. For 
statistics details, see Supplementary Tables 1-3. 
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Extended Data Table 1 | Theoretical in silico predictions of mitochondrial and plasma membrane localization of CB; and DN22-CB, 
receptor proteins, based on the programs MitoProt and PSORT 


Probability of Targeting 


Amino acid sequence 


MKSILDGLADTTFRTITTDLLYV GSNDIQYEDIKGDMASKLGYFPQKFPLTSFR 
GSPFQEKMTAGDNSPLVPAGDTTNITEFYNKSLSSFKENEDNIQCGENFMD 
MECFMILNPSQOLAIAVLSLTLGTFTVLENLLVLCVILHSRSLRCRPSYHFIGSLA 
VADLLGSVIFVYSFVDFHVFHRKDSPNVFLFKLGGVTASFTASVGSLFLTAIDRY 
ISIHRPLAYKRIVTRPKAVVAFCLMWTIAIVIAVLPLLGWNCKKLOSVCSDIFPLI 
DETYLMFWIGVTSVLLLFIVYAYMYILWKAHSHAVRMIQRGTOKSIINHTSEDG 
KVQVTRPDOQARMDIRLAKTLVLILVVLIICWGPLLAIMVY DVFGKMNELIKTVF 
AFCSMLCLLNSTVNPIYALRSKDLRHAFRSMFPSCEGTAQPLONSMGDSDCL 
HKHANNTASMHRAAESCIKSTVKIAKVTMSVSTDTSAEL 


MGSNDIQYEDIKGDMASKLGYFPQKFPLTSFRGSPFQEKMTAGDNSPLVPA 
GOTTNITEFYNKSLSSFKENEDNIQCGENFMDMECF MILNPSQQLAMIAVLSLT 
LGTFTVLENLLVLCVILHSRSLRCRPSYHFIGSLAVADLLGSVIFVYSFYDFHVFH 
RKDSPNVFLFKLGGVTASFTASVGSLFLTAIDRYISIHRPLAYKRIVTRPKAVVAF 
CLMWTIAIVIAVLPLLGWNCKKLOSVCSDIFPLIDETYLMFWIGVTSVLLLFIVY 
AYMYILWKAHSHAVRMIQRGTOKSINHTSEDGKVOQVTRPDOARMDIRLAKT 
LVUILVVLIICWGPLLAIMVY DVFGKMNKLKTVFAFCSMLCLLNSTVNPIIYALR 
SKDLRHAFRSMFPSCEGTAQPLONSMGDSDCLHKHANNTASMHRAAESCIK 
STVKIAKVTMSVSTDTSAEL 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


doi:10.1038/nature20168 


Designer matrices for intestinal stem cell and 


organoid culture 


Nikolce Gjorevski!, Norman Sachs’, Andrea Manfrin!, Sonja Giger', Maiia E. Bragina!, Paloma Ord6fiez-Moran?, 


Hans Clevers? & Matthias P. Lutolf* 


Epithelial organoids recapitulate multiple aspects of real organs, 
making them promising models of organ development, function 
and disease!~?. However, the full potential of organoids in research 
and therapy has remained unrealized, owing to the poorly defined 
animal-derived matrices in which they are grown*. Here we 
used modular synthetic hydrogel networks™® to define the key 
extracellular matrix (ECM) parameters that govern intestinal stem 
cell (ISC) expansion and organoid formation, and show that separate 
stages of the process require different mechanical environments and 
ECM components. In particular, fibronectin-based adhesion was 
sufficient for ISC survival and proliferation. High matrix stiffness 
significantly enhanced ISC expansion through a yes-associated 
protein 1 (YAP)-dependent mechanism. ISC differentiation and 
organoid formation, on the other hand, required a soft matrix and 
laminin-based adhesion. We used these insights to build a fully 
defined culture system for the expansion of mouse and human 
ISCs. We also produced mechanically dynamic matrices that were 
initially optimal for ISC expansion and subsequently permissive 
to differentiation and intestinal organoid formation, thus creating 
well-defined alternatives to animal-derived matrices for the culture 
of mouse and human stem-cell-derived organoids. Our approach 
overcomes multiple limitations of current organoid cultures and 
greatly expands their applicability in basic and clinical research. 
The principles presented here can be extended to identify designer 
matrices that are optimal for long-term culture of other types of 
stem cells and organoids. 

Organoids formed by self-organizing stem cells resemble their 
native counterparts in cellular content, multicellular architecture and 
functional features. As such, they are emerging as powerful tools in 
basic and translational research. Intestinal organoids possess a par- 
ticularly high level of multicellular organization and a wide range 
of potential applications’. When cultured under the appropriate 
3D conditions, single Lgr5-expressing ISCs undergo cycles of self- 
renewal, differentiation and morphogenesis, and self-organize into 
crypt-villus domains that house both cycling ISCs and differentiated 
intestinal cells*. These organoids hold promise as models of the ISC 
niche, intestinal development and function’, as models of patient-specific 
epithelial diseases’, and as sources of tissue for autologous transplants'®. 

Current organoid culture methods are complicated by the nearly 
exclusive dependence on animal-derived hydrogels, including 
Matrigel!’ and collagen’, as the 3D matrix. These matrices feature 
complex and variable compositions'*"“, are not conducive to controlled 
modifications and pose risks of immunogen and pathogen transfer, 
which makes them unsuitable for organoid expansion with downstream 
clinical applications. Synthetic ECM analogues are promising alterna- 
tives to native matrices™!°. Biochemical and biophysical components 
can be varied, and their influence on biological processes can be tested 
systematically, allowing for the identification of cellular-system-specific 


3D culture environments. Here we used synthetic ECMs to define the 
microenvironmental parameters that govern ISC behaviour and intes- 
tinal organoid formation. We used this insight to create a well-defined 
3D matrix for the culture of ISCs and intestinal organoids. 

To assess whether a 3D gel matrix is sufficient for ISC expansion 
and organoid formation, we embedded single dissociated mouse ISCs 
into enzymatically crosslinked polyethylene glycol (PEG) hydrogels'®, 
which are soft and hydrated, thus mimicking the basic physical proper- 
ties of Matrigel without contributing any biochemical signals. Whereas 
cells cultured in Matrigel divided to form lumen-containing colonies 
(Extended Data Fig. 1a), those embedded in soft (~300 Pa) PEG gels 
did not (Fig. 1a), suggesting that not only the physical support in 3D, 
but also biochemical signals produced by the matrix are required for 
ISC survival and proliferation. Indeed, enrichment of our inert PEG 
gels with fibronectin, laminin-111, collagen IV, hyaluronic acid and 
perlecan, key ECM components located at the bottom of the crypts 
in vivol’?}, enhanced ISC survival and proliferation (Fig. 1a, b). 

Aiming to create the most minimal, chemically defined environ- 
ment, we replaced full-length fibronectin with an RGD (Arg-Gly-Asp) 
peptide. Like fibronectin, RGD stimulated ISC colony formation in a 
concentration-dependent fashion (Fig. 1c, d). Freshly isolated mouse 
intestinal crypts embedded in RGD-functionalized PEG gels (PEG 
RGD) likewise survived and formed expanding colonies (Extended 
Data Fig. 1b). Cells maintained in PEG RGD preserved their colony 
formation efficiency and Lgr5 expression after at least four passages, 
indicating that single ISCs can be successfully expanded within PEG 
RGD gels (Fig. le, f) without losing their organoid formation capacity, 
once re-embedded in Matrigel (Fig. 1g, h). Quantitative reverse tran- 
scription polymerase chain reaction (qRT-PCR) confirmed that, under 
ISC expansion conditions, Lgr5 was highly expressed in both PEG RGD 
and Matrigel (Extended Data Fig. 1c). As expected, markers of dif- 
ferentiated intestinal cell types became upregulated under organoid 
formation conditions. Notably, lysozyme appeared to be significantly 
expressed in Matrigel-based ISC expansion culture, indicating that 
colonies expanded in Matrigel contain a population of differentiated 
Paneth cells, which were notably absent in the PEG gels (Extended Data 
Fig. 1d). Hence, minimal PEG-based matrices may be more suitable for 
the maintenance of a pure Lgr5* ISC population. 

Embryonic and adult stem cells, including ISCs, are influenced by 
their mechanical environment”””?. We increased our understanding of 
the mechanical regulation of ISCs by considering the effects of matrix 
stiffness, which have not been examined before, as performing con- 
trolled mechanical modulations in vivo and in Matrigel is challeng- 
ing. In the presence of 1 mM RGD, matrix stiffness indeed influenced 
ISC colony formation (Fig. 2a, b, Extended Data Fig. le): ISC expan- 
sion was optimal within matrices of intermediate stiffness (1.3 kPa), 
whereas stem cells embedded in soft matrices (300 Pa) proliferated 
poorly. ISC expansion in stiff matrices required cytoskeletal tension 


lLaboratory of Stem Cell Bioengineering, Institute of Bioengineering, School of Life Sciences (SV) and School of Engineering (STI), Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, 
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Figure 1 | Building a synthetic ISC niche. a, Mouse ISCs embedded 

in plain PEG and PEG functionalized with the indicated ECM 
components. b, Quantification of colony formation efficiency. HA, 
hyaluronic acid. ¢, d, Effect of RGD concentration on ISC colony 
formation in PEG (c) and quantification (d). e, f, ISC colony formation in 
Matrigel and PEG RGD (e) and quantification (f). g, Organoids formed 
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from ISC colonies expanded in Matrigel and PEG RGD. Asterisks indicate 
autofluorescence. Arrows indicate Lgr5-expressing ISCs. h, Organoids 
formed from ISC colonies expanded in PEG RGD and then transplanted 
into Matrigel contain differentiated intestinal cells. Graphs show 
individual data points derived from n = 3 independent experiments and 
means. *P < 0.05; **P< 0.01; ***P< 0.01. Scale bars, 501m. 
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Figure 2 | Matrix mechanical properties control ISC proliferation. 

a, ISC colony formation in PEG RGD hydrogels of varying stiffness. 

b, Effect of matrix stiffness, cellular contractility and adhesion on ISC 
colony formation. Graphs show individual data points derived from n = 3 
independent experiments and means. ¢, d, Distribution of YAP in day 

1 ISC colonies (c) and quantification (d). n = 27 colonies (300 kPa) and 
n=21 (1.3kPa). Data are represented as mean +s.e.m. e, f, Distribution of 


YAP in day 1 colonies formed in 1.3 kPa matrices treated with DMSO or 
blebbistatin (e) and quantification (f). n = 27 colonies (DMSO) and n= 30 
(blebbistatin). Data are represented as mean + s.e.m. g, Effect of YAP 
knockdown on ISC colony formation (g) and quantification (h). Graphs 
show individual data points derived from n = 3 independent experiments 
and means. *P < 0.05; **P < 0.01; ***P < 0.01. Scale bars, 501m. 
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Figure 3 | ISC proteolytic activity influences proliferation, morphology 
and fate. a, Colony formation efficiency of ISCs embedded in degradable 
(DG) or non-degradable (N-DG) PEG gels of varying stiffness. b, c, The 
effect of matrix degradability on ISC colony shape (b) and quantification 
(c). Data are represented as mean + s.d. Number of colonies analysed was 
13 (soft PEG DG), 25 (stiff PEG DG), 5 (soft PEG N-DG), 5 (stiff PEG 
N-DG), 7 (soft PEG DG + GM6001) and 8 (stiff PEG DG + GM6001). 

d, Colonies in PEG DG and PEG N-DG, stained for F-actin and 


and integrin-based adhesion, as pharmacological blocking of cell con- 
tractility and integrin binding reduced colony formation (Fig. 2b). 

YAP, an effector of the Hippo signalling pathway, is crucial for ISC 
self-renewal and expansion during intestinal tissue regeneration and 
organoid formation in vitro~*”>. In addition, YAP has been implicated 
in cellular mechanosensing and mechanotransduction, displaying 
enhanced nuclear translocation and activation in response to mechanical 
tension*®”’, To test whether YAP mediates the mechanical dependence 
of ISC expansion, we assessed its subcellular localization within 
colonies grown in soft matrices compared to stiff matrices. Whereas 
colonies in soft matrices featured predominantly cytoplasmic local- 
ization, YAP was highly nuclear within cells grown in stiff matrices 
(Fig. 2c, d). Pharmacological dissipation of mechanical tension led to a 
significant reduction in the nuclear translocation of YAP in stiff matri- 
ces (Fig. 2e, f). Likewise, controlled matrix softening (Extended Data 
Fig. 2a) resulted in a change in colony morphology (Extended Data 
Fig. 2b), cytoplasmic translocation of YAP within 4-8 h (Extended 
Data Fig. 2c, d), and growth arrest (Extended Data Fig. 2e). Depletion 
of YAP, using lentiviral delivery of short hairpin RNA (shRNA) 
(Extended Data Fig. 2f), led to a marked reduction in ISC colony for- 
mation within PEG hydrogels (Fig. 2g, h), as did blocking YAP activity 
with the small molecule verteporfin (Extended Data Fig. 2g), which 
suggests that YAP activity is required for ISC expansion within PEG 
matrices. Collectively, these data suggest that matrix stiffness controls 
ISC expansion by influencing YAP activity. 

Noting that proteolytic remodelling is important for morphogenesis 
and repair”®, we reasoned that making the PEG RGD gels suscepti- 
ble to degradation by matrix metalloproteinases would render them 
more physiological, and potentially enhance ISC growth. Therefore, 
a mutated collagen-I-derived sequence'®, which can be rapidly 
cleaved by cell-secreted matrix metalloproteinases, was incorporated 
in the backbone of the polymer network. Although conferring deg- 
radability enhanced ISC colony formation in stiff matrices (Fig. 3a), 
the resulting colonies adopted irregular shapes (Fig. 3b, c), were 
depolarized and disorganized (Fig. 3d, e), and had attenuated Lgr5 
expression (Fig. 3f, g) and organoid formation capacity (Fig. 3h, i). 
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E-cadherin. e, Quantification of the percentage of columnar and polarized 
colonies. f, g, Lgrs5-eGFP expression within colonies cultured in PEG 

DG and PEG N-DG (f) and quantification (g). h, i, Organoid formation 
from ISC colonies expanded in PEG DG or PEG N-DG and transferred 

to Matrigel (h) and quantification (i). Individual data points derived from 
n= 3 independent experiments and means are shown (a, e, g, i). *P < 0.05; 
** P< 0.01; ***P < 0.01. Scale bars, 50m. 


RNA sequencing and gene set enrichment analysis revealed significant 
upregulation of stress and inflammation signalling-related genes within 
degradable matrices (Extended Data Fig. 3, Extended Data Tables 2 
and 3; GEO, GSE85391). Hence, by enabling ISCs cultured within a 
stiff environment to engage in rapid matrix degradation, we modelled 
a pathological, inflammation-like state, that appears to be deleterious 
to stem cell maintenance. 

Next, to test whether PEG RGD gels optimized for ISC expansion 
also support differentiation and organoid formation, we switched 
expanded colonies derived from Lgr5-eGFP-ires—CreERT2 mouse 
intestinal crypts to differentiation conditions, which led to global loss of 
Lgr5-eGFP and colony destruction in both degradable and non-degra- 
dable PEG RGD gels (Extended Data Fig. 4a). To shed light on the defi- 
ciencies blocking organoid formation within PEG RGD, we monitored 
differentiating ISC colonies in both PEG RGD and Matrigel by time- 
lapse imaging. In Matrigel, we observed attenuation in Lgr5 expression 
within 24h (Extended Data Fig. 4b). Faint Lgr5-eGFP expression was 
detected at punctate locations, which formed buds by 48h; the Lgr5- 
eGFP signal was then recovered in the newly formed buds. In synthetic 
matrices, the initial steps paralleled those in Matrigel. However, instead 
of forming external buds, the Lgr5-eGFP epithelial patches thickened 
and buckled (Extended Data Fig. 4b). Comparing the temporal profiles 
of YAP localization within PEG and Matrigel, we found that, within stiff 
matrices, the proportion of cells with nuclear YAP sharply dropped over 
time (Extended Data Fig. 4c). In contrast, colonies cultured in Matrigel 
had sustained, intermediate levels of nuclear YAP (Extended Data 
Fig. 4c). We proposed that, although stiff matrices initially enhance 
YAP activity within ISCs by promoting cellular tension”, the result- 
ing colony growth within a stiff confining environment may give rise 
to compression, which would lead to YAP inactivation, thus retarding 
further growth and morphogenesis”>”’. 

We therefore sought to design mechanically dynamic matrices, which 
would afford the higher stiffness initially required for YAP activation 
and ISC expansion, but would subsequently soften to alleviate the 
accumulation of compressive forces, and permit in vitro organogenesis. 
Thus, we created hybrid PEG hydrogels wherein a stable polymer 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


= ae -O- ‘Si 
a see eohis 100% sPEG c 
e a * eee 100% sPEG 5090PEG 
8 ‘ 8 \ Ae 75% dPEG we : 
vaNs oe eal a 2,000 ~O- 100% dPEG 
> » 9 ! ae: f j 
= , ’ 
g& Se as 0 84 O : 
se rag — an 
5a 1,5004 4 
ects eons = ° - 
ia _ a“ n & so — . — 
0 : a Oe eG 1,000 5 75% dPEG } ¢ pecoeeres 
tao A St i op ¥ 2% See | 
23 26° i SR HON Ss : S00 ‘¢ yer 
= . ’ J 
So fegete.___-- Sie ae? 
a§e O80! incincun an 
o£ \ 4 
= = e@ eo” Ime in culture 04 


sPEG @ dPEG 


OA Yo om 
SP Pe @ 


Organoids formed in artificial matrices 


E-cadherin DAPI 


e ISC expansion 


d 


Organoids formation 


Non-degradable 


1. Stiff matrix stiff matrix 


2. RGD =~ {9 = _ 
Degradable 
stiff matrix 


Figure 4 | Intestinal organoid formation within well-defined matrices. 
a, A schematic illustrating the chemical basis of the mechanically dynamic 
hydrogels. b, Mechanical characterization of hybrid sPEG-dPEG gels. For 
each condition, graph shows individual data points derived from n= 3 
independently prepared gels and means. G', shear modulus. c, Organoid 


backbone (sPEG, mechanically static PEG) was partially replaced 
with hydrolytically degradable polymer (dPEG, mechanically dynamic 
PEG)”, the presence of which affords sustained gel softening (Fig. 4a). 
Varying the ratio of sPEG to dPEG within the gels allowed us to con- 
trol the softening profile over time (Fig. 4b). ISC colony formation in 
all conditions was extensive. Upon switching the colonies to organoid 
formation conditions, we found that matrix softening was not sufficient 
to induce budding in the presence of RGD only (data not shown). In 
the presence of RGD and laminin-111, but not laminin-derived pep- 
tides (Extended Data Fig. 5i), the softening profile of the gels signifi- 
cantly influenced the emergent phenotype (Fig. 4c): organoids formed 
only in gels with a stiffness of around 190 Pa, suggesting that organoid 
formation in a minimal environment occurs in a narrow mechanical 
window. Immunofluorescence analysis demonstrated the presence 
of differentiated cells within organoids grown in minimal matrices 
(Fig. 4d). As predicted, at the time of differentiation, the extent of YAP 
activation was significantly higher in softening matrices, compared 
with those of constant, high stiffness (Extended Data Fig. 5j). We 
suggest that this difference accounts for the separate morphogenetic 
outcomes within the two matrices. 

Finally, to test whether these matrices are also suitable for the culture 
of clinically relevant human counterparts, we embedded human small 
intestinal and colorectal cancer organoids in PEG RGD matrices. As 
reported before, growth was less vigorous than that of mouse cells, yet 
most of the embedded organoids survived and continued to expand 
(Extended Data Fig. 5k, 1), suggesting that the PEG-based hydrogels 
reported here can be adapted for the culture of human organoids. 

We report the first, to our knowledge, synthetic matrix for the expan- 
sion of mouse and human ISCs and a minimal matrix for the culture 
of mouse intestinal organoids. In addition to a final product with a 
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formation within matrices of varying softening rates. d, Organoids formed 
in PEG hydrogels contain differentiated intestinal cells. e, Summary of the 
rules governing different stages of organoid formation within an artificial 
matrix. Scale bars, 501m. 


wide range of applications in research and therapy, the design process 
provides mechanistic insight into how the 3D microenvironment 
influences multiple aspects of organoid formation, including ISC 
proliferation, fate decisions and self-organization. YAP has been 
implicated in the regulation of ISC behaviour, and described as a 
mechanotransducer in other cellular systems—here we bridge these 
roles, demonstrating that the YAP-mediated control of ISC self-renewal 
can have a mechanical upstream component. Notably, elucidating the 
role of ECM components and physical parameters on ISC behaviour 
was made possible by the modular nature of the synthetic hydrogels. 

In creating a well-defined matrix for intestinal organoids, we identi- 
fied general design principles that may be adopted in building synthetic 
scaffolds for different organoid systems. We found that, within a minimal 
environment, separate stages of the organoid formation process require 
different microenvironments (Fig. 4e). Indeed, the native cellular 
environment evolves to accommodate the dynamic needs of tissues 
during development, homeostasis and repair. Mimicking the dynamic 
character of native microenvironments in designer matrices may not 
only replace animal-derived gels for organoid culture, but may also help 
to enhance the similarity of organoids to real organs. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Data reporting. The experiments were not randomized and the investigators were 
not blinded to allocation during experiments and outcome assessment. No statis- 
tical methods were used to determine sample size. 

Mice. Intestinal crypts were extracted from 5-10-week-old heterozygous 
Lgr5-eGFP-IRES-CreERT2 mice (Jackson Laboratory), following animal exper- 
imentation protocols prescribed by EPFL and FELASA. 

Intestinal crypt isolation. Murine intestinal crypts were isolated following proce- 
dures described previously. In brief, the proximal part of the intestine was collected, 
opened longitudinally and washed with ice-cold PBS. The luminal side of the 
intestine was scraped using a glass slide to remove luminal content and villous 
structures. After washing with ice-cold PBS again, the intestine was cut into 
2-4mm pieces with scissors. The pieces were transferred to a 50 ml Falcon tube 
and further washed with cold PBS (5-10 times) with gentle vortexing. Intestinal 
fragments were then incubated in 20mM EDTA/PBS for 20 min on ice. EDTA was 
removed, 10 ml of cold PBS was added and crypts were released by manual shaking 
of the suspension for 5 min. The supernatant was collected and passed through 
a 70-\um strainer (BD Biosciences). The remaining tissue fragments were resus- 
pended in 10 ml cold PBS, triturated 5-10 times and the supernatant was passed 
through a 70-\1m strainer. The previous step was repeated a second time. The 
three crypt-containing fractions were combined and centrifuged at 110g for 5 min. 
The pellet was resuspended in 10 ml cold Advanced DMEM/F12 (Invitrogen) and 
centrifuged at 84g to remove single cells and tissue debris. The resulting pellet was 
enriched in crypts, which were subsequently dissociated or directly embedded in 
PEG gels or in Matrigel (BD Biosciences; growth factor reduced, phenol red-free 
formulation). When needed, crypts or ISC colonies were dissociated enzymatically 
by incubating for 8 min at 37°C in 1 ml TrypLE Express (Life Technologies), sup- 
plemented with DNase I (2000 U ml}; Roche), 0.5mM N-acetylcysteine (Sigma) 
and 101M Y27632 (Stemgent). Undigested clusters were removed by passing the 
suspension through a 40 1m strainer. 

Cell culture. Freshly isolated mouse crypts or single cells from dissociated mouse 
ISC colonies were embedded in Matrigel or PEG gels, which were cast into 20-11 
droplets at the bottom of wells in 24-well plate. Following polymerization (15 min, 
37°C), the gels were overlaid with 500 :l of ISC expansion medium (Advanced 
DMEM/F12 containing Glutamax, HEPES, penicillin-streptomycin, B27, N2 
(Invitrogen) and 11M N-acetylcysteine (Sigma)), supplemented with growth fac- 
tors, including EGF (50ng ml “!; R&D), Noggin (100 ng ml; produced in-house) 
and R-spondin (500 ng ml}; produced in-house), and small molecules, including 
CHIR99021 (341M; Millipore) and valproic acid (1 mM; Sigma). For single-cell 
culture, thiazovivin (2.5 1M; Stemgent) was included in the medium during the 
first two days. To induce stem cell differentiation and organoid formation, the 
medium was removed, the gels were washed with PBS and fresh medium contain- 
ing EGE, Noggin and R-spondin was added. Human small intestinal and colorectal 
cancer organoids were generated as described previously*!” and grown in 20-1 
droplets of Matrigel or PEG gels overlaid with Advanced DMEM/F12 contain- 
ing Glutamax, HEPES, penicillin-streptomycin, B27 (Life Technologies), Wnt3a 
(50% conditioned medium; produced in-house; only for small intestinal orga- 
noids), R-spondin 1 (20% conditioned medium; produced in-house), Noggin 
(10% conditioned medium; produced in-house), N-acetylcysteine (2 |1M; Sigma), 
Nicotinamide (10 mM; Sigma), human EGF (50 ng ml~!; Peprotech), A83-01 
(500 nM; Tocris), $B202190 (10|1M; Sigma), Prostaglandin E2 (10 nM; Tocris); 
Gastrin (10 nM; Tocris), and Y-27632 (10j.M; Abmole). In general, growth factors 
were replenished every two days, with full medium change taking place every four 
days. Where indicated, the following compounds were used at the specified con- 
centrations: blebbistatin (Sigma, 12.5 1M), ML7 (Calbiochem, 101M), cytochalasin 
D (Merck-Millipore, 0.1 1g m7), echistatin (Sigma, 500 nM). 

PEG, peptides and synthesis of hydrogel precursors. Vinylsulfone- 
functionalized 8-arm PEG (8-arm PEG-VS or sPEG) was purchased from 
NOE and acrylate-functionalized 8-arm PEG (8-arm PEG-Acr or dPEG) was 
purchased from Creative PEGWorks. The transglutaminase (TG) factor XHI 
(FXIIa) substrate peptides Ac- FKGGGPQGIWGQ-ERCG-NH2 (TG-DG-Lys), 
Ac-FKGG-GDQGIAGF-ERCG-NH2 (TG-NDG-Lys) and H-NQEQVSPL- 
ERCGNH2 (TG-Gln) and the RGD-presenting adhesion peptide H-NQEQVSPL- 
RGDSPG-NH2 (TG-Gln-RGD) were purchased from GL Biochem. To couple 
the FXII]a substrate peptides to the 8-arm PEG-VS or 8-arm PEG-Acr, they were 
mixed with the PEG powder in a 1.2 stoichiometric excess (peptide-to- VS group); 
the combined solids were dissolved in triethanolamine (0.3 M, pH 8.0), and 
allowed to react for 2h at 37°C. The reaction solution was dialysed (Snake Skin, 
MWCO 10K, PIERCE) against ultrapure water for 3 days at 4°C, after which the 
five products ((PEG-VS)-DG-Lys, (PEG-VS)-NDG-Lys, (PEG-VS)-Gln, (PEG- 
Acr)-NDG-Lys, (PEG-Acr)-Gln) were lyophilized. The resulting solid precursors 
were dissolved in ultra-pure water to make 13.33% w/v stock solutions. 
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Formation and dissociation of PEG hydrogels. Appropriate volumes of 13.33% 
w/v PEG precursor solutions were mixed in stoichiometrically balanced ratios to 
generate hydrogel networks of a desired final PEG content. Hydrogel formation 
was triggered by the addition of thrombin-activated FXIIIa (10 U ml~’; Galexis) 
in the presence of Tris-buffered saline (TBS; 50 mM, pH 7.6) and 50mM CaCh. 
The spare reaction volume was used for the incorporation of dissociated ISCs, 
fragments of human small intestinal or colorectal cancer organoids, and ECM 
components: TG-RGD-Gln, fibronectin (0.5 mg ml~ I, Invitrogen), laminin-111 
(0.1 mg ml“ |; Invitrogen), collagen-IV (0.25 mg ml~'; BD Bioscience), hya- 
luronic acid (0.5 mg ml; gift from D. Ossipov, Uppsala University), perlecan 
(0.05 mg ml '; Sigma). Gels were allowed to crosslink by incubating at 37 °C for 
15 min. Dissociation and release of colonies grown in PEG for downstream cell 
processing or re-embedding was accomplished by enzymatic digestion of the 
gels. Gels were carefully detached from the bottom of the plate using the tip of a 
metal spatula and transferred to a 15-ml Falcon tube containing 1 ml of TrypLE 
Express (Life Technologies), supplemented with DNase I (2,000 U ml}; Roche), 
0.5mM N-acetylcysteine (Sigma) and 10|1M Y27632 (Stemgent). Following diges- 
tion (10 min, 37°C), the cell suspension was washed with 10 ml of cold medium, 
passed through a 40-|1m strainer (BD Biosciences) and centrifuged at 1,200 r-p.m. 
for 5 min. 

Softening of PEG hydrogels through ester-based hydrolysis. To form mechan- 
ically dynamic PEG hydrogels, which underwent varying extents of spontaneous 
softening, hybrid hydrogels were formed from both PEG- VS and PEG-Acr hydrogel 
precursors. Specifically, to form a fast-softening 100% Acr gel, stoichiometric 
quantities of (PEG-Acr)-NDG-Lys and (PEG-Acr)-Gln precursors were allowed to 
crosslink. A slow-softening 50% Acr gel was formed by crosslinking stoichiometric 
amounts of (PEG-VS)-NDG-Lys and (PEG-Acr)-Gln precursors. A 75% Acr gel 
with intermediate kinetics of softening was formed by crosslinking the (PEG-Acr)- 
Gln precursor with half of the stoichiometric equivalent of (PEG-VS)-NDG-Lys 
and half of the stoichiometric equivalent of (PEG-Acr)-NDG-Lys. Regardless of the 
relative proportions of (PEG-VS) and (PEG-Acr) precursors within the hydrogel, 
its overall PEG content was varied to tune its initial mechanical properties. It 
should be noted that, by providing an initially stiff and later a soft environment, 
the mechanically dynamic matrices support both ISC expansion and organoid 
formation. Hence, ISCs can be expanded and organoids can be formed in the 
same hydrogel. 

PEG-alginate hydrogels for controlled matrix softening. Hybrid PEG-alginate 
(PEG-alg) gels were employed to induce a controlled drop in stiffness at a desired 
time. PEG-alg gels were formed by the simultaneous presence of activated FXII 
enzyme—to drive the crosslinking of the PEG macromers—and Ca** ions, which 
induce the crosslinking of the alginate polysaccharides. Hybrid gels were formed 
by casting a solution, containing 2% (w/v) of stoichiometrically balanced (PEG- 
VS)-NDG-Lys and (PEG-VS)-Gln precursors, 10mM TG-RGD-Gln, 10U ml! 
FXIIIa, 0.8% (w/v) alginate (Sigma) and dissociated ISCs, within a 1% agarose/2% 
gelatin mould, containing 20mM CaCl. The solution was incubated at 37°C for 
15 min, carefully de-molded from the agarose substratum and transferred to a 
12-well plate containing 1 ml of complete ISC expansion medium. Matrix softening 
was induced at the desired time by adding 1 U ml”! alginate lyase (Sigma), and 
incubating for 1h at 37°C. The digested gels were washed and transferred to freshly 
prepared ISC expansion medium. 

Mechanical characterization of PEG hydrogels. The shear modulus of the PEG 
gels was determined by performing small-strain oscillatory shear measurements 
ona Bohlin CVO 120 rheometer. In brief, preformed hydrogel discs 1-1.4mm in 
thickness were allowed to swell in complete cell culture medium for at least 3h, 
and were subsequently sandwiched between the parallel plates of the rheometer. 
The mechanical response of the gels was recorded by performing frequency sweep 
(0.1-10 Hz) measurements in a constant strain (0.05) mode, at 37°C. The shear 
modulus (G’) is reported as a measure of gel mechanical properties. 
Quantification of ISC colony formation efficiency, ISC colony shape, morphol- 
ogy and Lgr5 expression. To quantify the colony formation efficiency of single 
embedded ISCs, phase contrast z-stacks spanning the entire thickness of the cell- 
laden Matrigel or PEG gels were collected (Zeiss Axio Observer Z1) at 5 different 
locations within the gels. The Cell Counter plugin in Image] (NIH) was used to 
quantify the fraction of cells which had formed colonies at day 4 after seeding. To 
quantify colony circularity, phase contrast images of ISC colonies grown in the 
condition of interest (between 5 and 38 colonies per condition) were taken, and 
their contours traced manually in ImageJ. The circularity of the contours was meas- 
ured using the Measure algorithm in Image]. To characterize the cell morphology 
within ISC colonies grown in different conditions, phase contrast images of at least 
50 colonies were taken and the numbers of colonies containing packed columnar 
cells versus spread cells were counted. To quantify Lgr5 expression within ISC 
colonies grown within different matrices, fluorescence images of at least 50 colonies 
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per condition were recorded and the number of colonies expressing Lgr5-eGFP 
was counted. 

Laminin peptide hydrogels. To identify a short sequence that supports intestinal 
organoid culture, we created a library of soft (G' =200 Pa) hydrogels in which binding 
sequences from the laminin «1 subunit previously shown to be biofunctional**** 
(Extended Data Table 4) were tethered to the PEG backbone. Embedding frag- 
ments of pre-formed organoids and screening the library for organoid survival and 
growth revealed that two laminin-derived peptides—AG73 and A55—significantly 
enhanced organoid viability and supported further growth (Extended Data 
Fig. 5a). Presenting these two sequences (A55 and AG73) alongside in the same 
gel did not appear to have an additive effect, likely owing to a redundant adhesion 
mechanism. Hence, we focused on the sequence with a stronger individual effect, 
that is, AG73 and the corresponding PEG gels (referred to as TG PEG-AG73). 
Varying the amount of AG73 peptide tethered to the PEG gel backbone revealed 
a dose-dependent effect on intestinal organoid viability and growth (Extended 
Data Fig. 5b, c). Despite the improved rate of survival and morphogenesis in 
TG PEG-AG73 matrices compared with plain PEG or PEG RGD, the process 
was significantly less efficient compared with Matrigel, and morphological 
differences were apparent. Keeping in mind that the effect of AG73 was concen- 
tration-dependent, we attributed these differences to a potentially sub-optimal 
AG73 ligand density within the synthetic system. By design, there is an upper 
limit to the concentration of tethered factors that can be incorporated into the 
PEG system used thus far in the study: exceeding this limit disrupts the structural 
integrity of the gels. To overcome this limitation and enhance the biofunctionality 
of the synthetic matrix by increasing the concentration of AG73 ligands, we turned 
to chemically crosslinked PEG gels. Here, vinyl sulfone (VS)-conjugated 4-arm 
PEG precursors are covalently linked into solid hydrogels through Michael-type 
addition between VS groups and the thiols of a short crosslinker containing two 
cysteine residues. To incorporate the AG73 ligand at a high density, we designed 
a crosslinker in which the AG73 sequence was flanked by two short cysteine- 
containing sequences. The resulting gels (hereafter referred to as MT PEG-AG73) 
presented the AG73 ligand at a concentration of 3.1 mM, thus significantly 
surpassing the highest concentration achieved in the enzymatically crosslinked 
matrices. Embedding intestinal organoid fragments into MT PEG-AG73 revealed 
that the percentage of tissues that remained viable and continued to undergo 
morphogenesis approached that observed in Matrigel (Extended Data Fig. 5e). To 
verify the maintenance of ISCs within the organoids grown in MT PEG-AG73, we 
embedded tissues extracted from the Lgr5-eGFP reporter mouse and monitored 
eGFP expression. We observed that Lgr5-eGFP was expressed in the expected 
pattern: localized to the crypt-like buds of the organoids (Extended Data Fig. 5f). 
The fraction of organoids expressing the marker was significantly higher than in 
those cultured in TG PEG AG73, and at least as high as in organoids cultured in 
Matrigel (Extended Data Fig. 5g). We also confirmed that the organoids cultured 
in MT PEG-AG73 were polarized and contained differentiated cells (Extended 
Data Fig. 5h). 

Immunofluorescence analysis. ISC colonies or organoids embedded in Matrigel 
or PEG gels were fixed with 4% paraformaldehyde (PFA) in PBS (30 min, room 
temperature). The fixation process typically led to complete degradation of the 
Matrigel. Suspended tissues were collected and centrifuged (800 r.p.m., 5 min) 
to remove the PFA, washed with ultra-pure water and pelleted. Following resus- 
pension in water, the organoids were spread on glass slides and allowed to attach 
by drying. Attached organoids were rehydrated with PBS. Following fixation, 
organoids embedded in PEG or spread on glass were permeabilized with 0.2% 
Triton X-100 in PBS (1h, room temperature) and blocked (10% goat serum in 
PBS containing 0.01% Triton X-100) for at least 3h. Samples were subsequently 
incubated overnight at 4°C with phalloidin-Alexa 546 (Invitrogen) and primary 
antibodies against lysozyme (1:50; Thermo Scientific PA1-29680), mucin-2 (1:50; 
Santa Cruz sc-15334), chromogranin-A (1:50; Santa Cruz sc-13090), L-FABP 
(1:50; Santa Cruz sc-50380) and YAP1 (1:50; Santa Cruz sc-101199) diluted in 
blocking buffer. After washing with PBS for at least 3h, samples were incubated 
overnight at 4°C with secondary antibody Alexa 647 goat-c-rabbit (1:1000 in 
blocking solution; Invitrogen). Following extensive washing, stained organoids 
were imaged in epifluorescence (Zeiss Axio Observer Z1) or confocal (Zeiss 
LSM 710) mode. Alternatively, ISC colonies or organoids cultured in PEG were 
released from the hydrogel before PFA fixation, by incubating the gels with 
1mg ml! Dispase (Gibco) for 7 min at 37°C. The released colonies or organoids 
were fixed with PFA in suspension, and attached to glass coverslips, as described 
above. 

Immunohistochemical analysis of human organoids. Human organoids were 
fixed in 10% neutral buffered formalin, washed with PBS, dehydrated, and 
embedded in paraffin. Sections were stained with H&E or Ki67 antibody (1:250; 
Monosan). 


Lentivirus production and transduction. Lentiviral particles encoding for shRNA 
recognizing YAP (two sequences validated for knockdown, purchased from Sigma) 
or the pLKO.1-puro Non-Target control shRNA (Sigma) were generated in HEK 
293T cells, using third generation lentivirus packaging vectors. Transfection was 
carried out using the X-tremeGENE HP Transfection kit (Roche). After 48h, the 
supernatant was collected, filtered and ultracentrifuged at 50,000g for 2h at 20°C. 
The resulting pellet was resuspended in PBS and stored at —80°C. Lentiviral infec- 
tion of ISCs was performed by dissociating the ISC colonies (described above), 
resuspending the resulting single cells in ice-cold liquid Matrigel, containing 
10\M Y27632 and the concentrated lentiviral particles at a dilution of 1:10. ISCs 
were incubated with viral particles in a liquid suspension for 45 min on ice. The 
suspension was subsequently cast into droplets and allowed to form gels, which 
were overlaid with ISC expansion medium. The embedded ISCs proceeded to 
form colonies, which carried the transgene encoded by the virus. The cells were 
allowed to recover and form colonies for 36h, after which they were dissociated 
and encapsulated within PEG hydrogels or used for quantification of knockdown 
efficiency by qPCR. 

Quantitative real-time PCR (qPCR). ISC colonies or organoids grown in 
Matrigel or PEG gels were dissociated as described above, and RNA was extracted 
using an RNeasy Micro Kit (Qiagen). cDNA was synthesized using the iScript 
cDNA Synthesis Kit (Bio-Rad). qPCR was carried out using the Power SYBR 
Green PCR Master Mix (Applied Biosystems) and the primers listed Extended 
Data Table 1. 

RNA-seq: library preparation, sequencing, data processing, statistical analysis, 
interpretation and accession numbers. RNA-seq libraries were prepared using 
500 ng of total RNA and the Illumina TruSeq Stranded mRNA reagents (Illumina; 
San Diego, California, USA) on a Sciclone liquid handling robot (PerkinElmer; 
Waltham, Massachusetts, USA) using a PerkinElmer-developed automated script. 
Cluster generation was performed with the resulting libraries using the lumina 
TruSeq SR Cluster Kit v4 reagents and sequenced on the Illumina HiSeq 2500 using 
TruSeq SBS Kit v4 reagents. Sequencing data were processed using the Illumina 
Pipeline Software version 1.84. Purity-filtered reads were adapters and quality 
trimmed with Cutadapt and filtered for low complexity with seq_crumbs (v. 0.1.8). 
Reads were aligned against the Mus musculus.GRCm38.82 genome using STAR™. 
The number of read counts per gene locus was summarized with htseq-count*® 
using the Mus musculus.GRCm38.82 gene annotation. Quality of the RNA-seq data 
alignment was assessed using RSeQC”. Reads were also aligned to the Mus mus- 
culus.GRCm38.82 transcriptome using STAR and the estimation of the isoforms 
abundance was computed using RSEM™. Statistical analysis was performed for 
genes in the R software package. Genes with low counts were filtered out according 
to the rule of 1 count per million in at least 1 sample. Library sizes were scaled 
using TMM normalization in the EdgeR package® and log-transformed with 
the limma voom function”. Differential expression was computed with limma*! 
by fitting paired samples data into a linear model and performing all pairwise 
comparisons. To control for false discovery and multiple testing, we computed 
an adjusted P value, using the Benjamini-Hochberg method. Gene set expression 
analysis was performed with the freely available GSEA software” (Broad Institute), 
using differential expression values and pre-defined gene signatures as inputs. In 
particular, to test for upregulation of stress-related genes, the MSigDB gene set 
‘BIOCARTA_STRESS_PATHWAY’ was used. We checked for upregulation of 
colon cancer-related genes by using the inflammatory colon cancer signature, 
as identified by Sadanandam et al.**. Functional annotation and gene ontology 
analysis of significantly enriched gene sets was conducted using the MetaCore 
software. The accession numbers for the gene expression profiles described here 
are GEO: GSE85391. 

Statistical analysis and sample information. Statistically significant differences 
between the means of two groups were assessed by using a Student's t-test, whereas 
data containing more than two experimental groups were analysed with a one- 
way ANOVA followed by a Bonferroni’s multiple comparison test. All statistical 
analyses were performed in the GraphPad Prism 6.0 software. 

Data availability. RNA sequencing data that support the findings of this paper 
have been deposited to the Gene Expression Omnibus (GEO) public repository 
(GSE85391; https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi? acc=GSE85391). 
Source Data for Figs. 1-4 and Extended Data Figs 1-5 are provided with the paper. 
All additional relevant data are available upon request from the corresponding 
author. 
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Extended Data Figure 1 | Primary crypt culture in synthetic matrices, 
characterization of ISCs grown in PEG RGD culture with or without 
CHIR99021 and valproic acid. a, ISCs cultured in Matrigel and 
unmodified PEG gels for 24h. b, ISC colonies formed from freshly isolated 
mouse intestinal crypts embedded in PEG RGD. ¢, Relative mRNA levels 
of intestinal genes, quantified by qPCR. d, ISC colonies cultured under 


self-renewal conditions in Matrigel, but not PEG gels, contain lysozyme- 
expressing Paneth cells. e, Stiffness-dependent colony formation and 
quantification of ISCs cultured with EGF, noggin, R-spondin and Wnt3a. 
Graphs show individual data points derived from n =3 independent 
experiments and means. Scale bars, 501m. 
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Extended Data Figure 2 | Effect of controlled matrix softening on ISC are represented as mean +s.e.m. e, Alginate lyase treatment does not 
colony morphology, YAP activity and growth. a, Mechanical properties affect colony growth in PEG RGD gels. Softening of PEG-alg gels by 
of control and softened PEG alginate (PEG-alg) hybrid gels obtained by alginate-lyase-mediated digestion blocks colony growth. f, Quantification 
selective degradation of the alginate network. Graph shows individual of shRNA-mediated knockdown of YAP. Graph shows individual data 
data points derived from 3 independently prepared gels. b, Morphology points derived from 2 independent experiments and means. g, Effect 
of day 1 colonies in control and softened PEG-alg gels. c, d, Distribution of verteporfin on ISC colony formation in PEG RGD. Graph shows 
of YAP in day 1 colonies in control and softened PEG-alg gels (c) and individual data points derived from 3 independent experiments and 
quantification (d). n = 28 colonies (control) and n = 31 (softened). Data means.*P < 0.05; **P < 0.01; ***P < 0.01. Scale bars, 501m. 
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Extended Data Figure 3 | Stiff degradable matrices induce an (see Methods for details). b, Functional annotation of signalling pathway 
inflammation-like state in ISCs. a, Gene set enrichment analysis (GSEA) significantly upregulated in degradable matrices (details and statistics 
comparing RNA-seq gene expression data of ISCs cultured in degradable shown in Extended Data Tables 2, 3). 

compared to non-degradable matrices to published gene signatures 
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Extended Data Figure 4 | Comparison of organoid formation and 
YAP activity in Matrigel and PEG RGD. a, Organoid formation does 
not occur within synthetic PEG RGD matrices. ISC colonies cultured 

in non-degradable (PEG N-DG) and degradable (PEG DG) PEG RGD 
matrices, with or without GM6001 for 4 d, and subsequently cultured 
under organoid formation conditions for 2 d. b, Time-course analysis of 
morphology and Lgr5-eGFP expression during organoid formation. ISC 
colonies formed in Matrigel and PEG hydrogels were monitored for 48 h, 
following a switch to differentiation and organoid formation conditions. 
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c, Localization of YAP at day 1 and day 4 in ISC colonies formed in 300 Pa 
and 1.3 kPa PEG RGD gels. d, Quantification of nuclear translocation 

of YAP as a function of time in 300 Pa, 1.3 kPa PEG RGD and Matrigel. 
Data are shown as means +s.e.m. To assess the extent of YAP nuclear 
translocation, n = 21 (day 1, 1.3 kPa); 27 (day 2, 1.3 kPa); 22 (day 3, 

1.3 kPa); 22 (day 4, 1.3 kPa); 27 (day 1, 300 Pa); 6 (day 2, 300 Pa); 23 (day 
3, 300 Pa); 22 (day 4, 300 Pa); 30 (day 1, Matrigel); 28 (day 2, Matrigel); 

30 (day 3, Matrigel); 30 (day 4, Matrigel). *P < 0.05; **P < 0.01; 

*** P< (0.001. Scale bars, 50\1m. 
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Extended Data Figure 5 | Effect of laminin-derived peptides on 
organoid formation, and human organoid expansion in PEG RGD. 

a, Effect of different laminin-111-derived sequences on intestinal organoid 
viability. b, Morphology of intestinal organoids grown in Matrigel, plain 
PEG or PEG-AG73 gels. c, AG73-conjugated PEG matrices significantly 
enhance the growth of intestinal organoids. d, Effect of AG73 on 
intestinal organoid viability and growth is concentration-dependent. 

e, Quantification of intestinal organoid viability in Matrigel, TG 
PEG-AG73 and MT PEG-AG73. f, Morphology and Lgr5-eGFP 
expression in intestinal organoids grown in Matrigel and MT PEG-AG73 
gels. g, Quantification of Lgr5-eGFP expression in intestinal organoids 
expanded in Matrigel, TG PEG-AG73 and MT PEG-AG73. 

h, Establishment of apicobasal polarity and presence of Paneth (lysozyme) 
cells within organoids grown in MT PEG-AG73. i, AG73 peptides are 
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not capable of supporting differentiation and organoid formation from 
ISC colonies, whereas full-length laminin-111 is. j, Quantification of 
nuclear translocation of YAP as a function of time in cells cultiured within 
mechanically stable and softening gels. Data are shown as means + s.e.m. 
To assess the extent of YAP nuclear translocation, n = 21 (day 1, 100% 
sPEG), 27 (day 2, 100% sPEG), 22 (day 3, 100% sPEG), 22 (day 4, 100% 
sPEG), 30 (day 1, 75% dPEG), 30 (day 2, 75% dPEG), 30 (day 3, 75% 
dPEG), and 30 (day 4, 75% dPEG) colonies were analysed. k, 1, Phase 
contrast images (k) and Ki67 expression (1) of human ISC colonies and 
human patient-derived colorectal cancer organoids grown in PEG RGD. 
Graphs show individual data points derived from n =2 (a), n=3 (c-e, g) 
independent experiments and means. *P < 0.05; **P < 0.01; ***P< 0.001. 
Scale bars, 501m. 
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Extended Data Table 1 | Primers used for qPCR analysis of intestinal gene expression 


Mouse gene 


Forward primer 
ATTCGGTGCATTTAGCTTGG 
GCCTATCTCTGTGGGGTCAA 
AAGGTGATGAAGTGCGTCCT 
CCGACTTCAACCCAAGTGAT 
GTCACACTTCCTCGCTTTCC 
ATCCTGCACCACCAACTGCT 
TCCTCCTTTGAGATCCCTGA 


Reverse primer 
CGAACACCTGCGTGAATATG 
TTTCTTGGCACGGTACATCA 
GGTGTCGCAGGATAGAGAGG 
GAGCAAGGGACTCTGGTCTG 
TGGCTTTGCTGACTGACAAG 
GGGCCATCCACAGTCTICTG 
GCCATGTTGTTGTCTGATCG 
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Extended Data Table 2 | Functional annotation of genes significantly upregulated in degradable PEG RGD matrices versus 
non-degradable PEG RGD matrices, in terms of cellular pathways 


Immune response_M |F-mediated glucocorticoid regulation 


Protein folding and maturation_Posttransiational processing of neuroendocrine peptides 


Immune response_Substance P-stimulated expression of proinflammatorycytokines vis MAPKs 


immune response_IL-tsignaling pathway 

Immune response_Histamine Htreceptor signaling in immune response 
Muscle contraction_Relasdn signaling pathway 

PDE4 reguistion of cyto/chemokine expression in inflammatoryskin diseases 
Apoptosis and survival_Role of PKR in stress-induced apoptosis 

Immune response_HM GB YRAGE signaling pathway 


Immune response_IL-17 signaling pathways 


Performed using the Metacore software package. 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


Extended Data Table 3 | Functional annotation of genes significantly upregulated in degradable PEG RGD matrices versus non-degradable 
PEG RGD matrices, in terms of cellular processes 


Infammation_Histamine signaling 

inflammation_Neutro phil activation 

Inflammation_Innate inflammatoryresponse 
Development_Neurogenesis_Synaptogenesis 

Muscle contraction 

Cell adhesion_P latelet-endo thelium-euco cyte interactions 
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Muscle contraction_Relassn signaling 
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Immune response_Innate immune response to RNA viral infection 


Performed using the Metacore software package. 
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Extended Data Table 4 | Laminin-derived peptides 


Ac-NQEQVSPLREKRLOVOLSIRT-NH2 
Ac-NQEQVSPLNRWHSIVT RFG-NH2 
Ac-NQEQVSPLTWYKIAFORNRK-NH2 
Ac-NQEQVSPLGT NNVWVOSP SIQN-NH2 
Ac-NQEQVSPLROVFOVA YIIIKA-NH2 
Ac-NQEQVSPLGGFLKYTVSYDENH2 
Ac-NQEQVSPLRDOLM TVLANVT-NH2 
Ac-NQEQVSPLVOLRNGFP YFSY-NH2 
Ac-NQEQVSPLM EM QANLLLDRL-NH2 
Ac-NQEQVSPLAASIKVAVSADR-NH2 


Ac-NQEQVSPLAGT FALRGDNPQG-NH2 


Ac-NQEQVSPLATLOLOQEGRLHF(Nie)FDLGKGR-NH2 


Ac-GCRERKRLOVOLSIRTERCG-NH2 


See refs 33, 34. 
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Different tissue phagocytes sample apoptotic cells to 


direct distinct homeostasi 


Ryan J. Cummings!, Gaetan Barbet!’, Gerold Bongers!*, Boris M. Hartmann, Kyle Gettler*, Luciana Muniz! 


S programs 
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Glaucia C. Furtado!, Judy Cho!°, Sergio A. Lirab*” & J. Magarian Blander!?>)75+ 


Recognition and removal of apoptotic cells by professional 
phagocytes, including dendritic cells and macrophages, preserves 
immune self-tolerance and prevents chronic inflammation and 
autoimmune pathologies!”. The diverse array of phagocytes that 
reside within different tissues, combined with the necessarily 
prompt nature of apoptotic cell clearance, makes it difficult to 
study this process in situ. The full spectrum of functions executed 
by tissue-resident phagocytes in response to homeostatic apoptosis, 
therefore, remains unclear. Here we show that mouse apoptotic 
intestinal epithelial cells (IECs), which undergo continuous renewal 
to maintain optimal barrier and absorptive functions’, are not 
merely extruded to maintain homeostatic cell numbers‘, but are also 
sampled by a single subset of dendritic cells and two macrophage 
subsets within a well-characterized network of phagocytes in 
the small intestinal lamina propria®®. Characterization of the 
transcriptome within each subset before and after in situ sampling 
of apoptotic IECs revealed gene expression signatures unique to 
each phagocyte, including macrophage-specific lipid metabolism 
and amino acid catabolism, and a dendritic-cell-specific program 
of regulatory CD4* T-cell activation. A common ‘suppression of 
inflammation signature was noted, although the specific genes and 
pathways involved varied amongst dendritic cells and macrophages, 
reflecting specialized functions. Apoptotic IECs were trafficked to 
mesenteric lymph nodes exclusively by the dendritic cell subset 
and served as critical determinants for the induction of tolerogenic 
regulatory CD4t T-cell differentiation. Several of the genes that 
were differentially expressed by phagocytes bearing apoptotic 
IECs overlapped with susceptibility genes for inflammatory bowel 
disease’. Collectively, these findings provide new insights into the 
consequences of apoptotic cell sampling, advance our understanding 
of how homeostasis is maintained within the mucosa and set the 
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stage for development of novel therapeutics to alleviate chronic 
inflammatory diseases such as inflammatory bowel disease. 
Clearance of apoptotic epithelial cells within the respiratory, 
colonic and post-weaning mammary epithelium can be conducted 
by aptly positioned neighbouring epithelial cells, which serve as non- 
professional phagocytes’. To examine whether apoptotic IECs are 
also recognized by professional phagocytes within the small intestinal 
lamina propria (SILP), we generated mice that express transgenic 
diphtheria toxin receptor (DTR) fused to enhanced green fluorescent 
protein (eGFP), driven by the epithelium-specific villin promoter 
(VDTR mice). This enabled the experimental induction of apoptosis 
and allowed for tracking of apoptotic cell phagocytosis by acquisition 
of eGFP. The villin promoter drove transgene expression in IECs of the 
small and large intestine (Extended Data Fig. 1a, b). We observed no 
gross histological changes within the small or large intestine of VDTR 
mice relative to C57BL/6J (B6) controls (Extended Data Fig. Ic, d). 
Extensive eGFP expression co-localized with epithelial pan-cytokeratin 
and the actin cytoskeletal binding agent phalloidin throughout 
the small and large intestinal epithelia (Fig. la and Extended Data 
Fig. le-i). Injection of VDTR mice with 10 ng g! diphtheria toxin 
induced IEC death throughout the villi; dying IECs in control mice 
injected with phosphate buffered saline (PBS) were noted only at villi 
tips, characteristic of natural IEC turnover? (Extended Data Fig. 1)). 
Lowering the dose of diphtheria toxin to 2ng g~' showed no 
evidence of causing epithelial erosion, villus atrophy or inflammatory 
cell infiltration over time (Extended Data Figs Ic, right versus mid- 
dle panels, 2a, b). Expression of inflammatory II6, Il11b and Tuf genes 
was not induced in the ileum 4h after administration of either 2 or 
10ng g~' of diphtheria toxin. However, upregulation of these pro-in- 
flammatory genes was observed 16h after administration of 10ng g~! 
diphtheria toxin (Fig. 1b). We observed no bacterial translocation to 
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Figure 1 | A novel mouse model for inducing apoptosis of IECs under 
non-inflammatory conditions. a, Immunofluorescence for indicated 
markers on small intestine cryo-sections. b, (RT-PCR on VDTR ileum 
represents at least four independent experiments in duplicate. n = 4 mice 
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per group. One-way ANOVA; **P <0.01, *P< 0.05. NS, not significant. 
Data are mean +s.e.m. c, Immunofluorescence for cleaved caspase 3 
(CC3) on small intestine paraffin sections 4h after administration of 2 or 
10ng gl diphtheria toxin (DT). Scale bars, 50 (a) and 100 (c) pm. 
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Figure 2 | A single CD103* dendritic cell subset and two macrophage 
subsets sample apoptotic intestinal epithelial cells. a, b, Conventional 
(without eGFP overlaid) (a) and confocal (b) whole-mount microscopy 

of VDTR ileum. Scale bars, 100 j1m. Arrowheads in a indicate phagocytes 
near CC3* IECs. L, lumen. c, Flow cytometry analysis of total (grey) and 
eGFP* (green) phagocytes pre-gated on live CD45*MHCII"CD1 Ic* cells. 
d, Percentage and absolute number of phagocytes from c. e, ImageStream 


the intestinal lamina propria after treatment with either dose of diph- 
theria toxin for 4 h, in contrast to 10ng g! diphtheria toxin at 24h or 
with 3% dextran sodium sulphate (DSS) (Extended Data Fig. 2c, d). 
Staining for cleaved caspase-3 (CC3), a marker of early apoptosis, was 
significantly increased in a dose-dependent manner within the terminal 
ileum of diphtheria-toxin-treated compared to PBS-treated VDTR mice 
(Fig. 1c and Extended Data Fig. 2e, f). We thus chose 2 ng g~! as the 
diphtheria toxin dose concentration that would increase the likelihood 
of observing phagocytic sampling of apoptotic IECs without eliciting 
inflammation or epithelial barrier disruption. 

Using whole-mount microscopy on excised small intestine tissue, 
we localized CC3 labelling to eGFP* IECs and CD11c expression to 
phagocytes, which appeared centrally within villi and proximally to the 
CD31* vasculature (Extended Data Fig. 3a, b). We detected numerous 
CC3* IECs in the small intestine of diphtheria-toxin-treated VDTR 
mice (Fig. 2a and Extended Data Fig. 3c). CD11c* cells extended 
dendrites towards the base of apoptotic IECs upon diphtheria toxin 
treatment, into the space of the eGFP signal and not the lumen 
(as happens during soluble antigen retrieval®) (Fig. 2a and Extended 
Data Fig. 3d). Some dendrites ended with globular shapes, charac- 
teristic of phagocyte sampling® (Extended Data Fig. 3e), and 
were observed in the same optical confocal section as CC3* IECs 
(Fig. 2b and Extended Data Fig. 3f). CC3* IECs were also increased in 
diphtheria-toxin-treated mice (Fig. 2b) relative to control (Extended Data 
Fig. 3g, h), consistent with the quantification in Extended Data Fig. 2f. 

We monitored apoptotic IEC sampling by tracking the appearance 
of eGFP within phagocytes. We set sub-gates by flow cytometry 
within haematopoietic SILP cells (Extended Data Fig. 4a) from VDTR 
mice, compared to B6 mice (Extended Data Fig. 4b). All intestinal 
CD45*+MHCII*CD11c* phagocyte subsets, divided on the basis of 
surface expression of the integrins wE (CD103) and aM (CD11b)°, 
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CD11b 


CD64 


of SILP VDTR cells. Data represent three independent experiments. DIC, 
differential interference contrast microscopy. f, Flow cytometry analysis of 
total phagocytes with corresponding eGFP* cells. Data are mean +s.e.m., 
n=3 B6 mice; n=6 VDTR mice treated with 2 ng g! diphtheria toxin for 
1-4h; n=9 VDTR mice treated with PBS; one-way ANOVA; *P < 0.05; 
NS, not significant. Data in c, e, and f, represent SILP cells from VDTR 
mice, at least three independent experiments. 


contained the eGFP label in VDTR mice, regardless of diphtheria 
toxin treatment. This indicated constitutive apoptotic IEC sampling, 
which was not observed for monocytes and granulocytes (Extended 
Data Fig. 4b-e). The number of eGFP* phagocytes was significantly 
increased 4h after diphtheria toxin administration when compared 
to control (Extended Data Fig. 4f). Of all eGFP*+ phagocytes from 
diphtheria-toxin-treated mice (Extended Data Fig. 4c), CD103*CD11b* 
and CD103- CD11b* (CD11b*) phagocytes were significantly increased 
at 2h and 4h, respectively, relative to PBS-treated mice (Fig. 2c, d). The 
total number of phagocytes was not increased relative to PBS-treated 
mice, suggesting that the increase in eGFP* phagocytes was not due to 
an influx of cells, but rather an increase in phagocytes carrying apoptotic 
IEC cargo (Fig. 2c, d). ImageStream analyses showed eGFP* inclusions 
within MHCII*CD45*CD11c*Hoechst* phagocytes (Fig. 2e), indi- 
cating internalization of apoptotic bodies or degradation of formerly 
engulfed apoptotic IECs. 

Assessment of the expression of CD64, a marker of macrophages 
in the mouse intestine®, and CD24, expression of which defines 
dendritic cells®, revealed that CD103* phagocytes were predom- 
inately CD24*CD64-, defined as dendritic cells® (Fig. 2f). Minor 
populations of CD24*CD64~ dendritic cells were also present within 
the CD103*CD11b* and CD11b* phagocytes, which were otherwise 
predominantly comprised of CD24~ CD64* macrophages (Fig. 2f and 
Extended Data Fig. 4g). The CD64*CD103*CD11b* macrophage 
population was most notable as it has not previously been reported’, 
although its CD103 expression may be transient®. Of CD103*CD11b* 
and CD103~CD11b* phagocytes, only CD24 CD64* macrophages 
contained the eGFP label (eGFP*), whereas CD24*CD64~ dendritic 
cells did not (eGFP” )(Fig. 2fand Extended Data Fig. 4g). Notably, only 
the CD103* population of CD24*CD64~ dendritic cells were eGFP* 
(Fig. 2f and Extended Data Fig. 4g). Thus, of the five phenotypically 
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distinct SILP phagocytes, only CD11b~ CD103* (CD103*) dendritic 
cells and two macrophage subsets, CD103*CD11bt and CD11b*, 
sample apoptotic IECs (Fig. 2f and Extended Data Fig. 4h). 

We next determined the transcriptional profiles that result from apop- 
totic IEC sampling by isolating the eGFP* and equivalent eGFP~ popu- 
lations of each subset (Extended Data Fig. 5a) 4h after administration of 
2ng g' diphtheria toxin, conditions that maintain a non-inflammatory 
state (Fig. 1 and Extended Data Fig. 2). Principal component analysis of 
gene expression revealed distinct clustering into dendritic cells and mac- 
rophages, and further clustering based on eGFP content (Extended Data 
Fig. 5b-f). Comparing gene expression profiles of eGFP* and eGEP™ 
phagocytes revealed predominant transcriptional downregulation in 
CD103* dendritic cells and upregulation in macrophages after apoptotic 
IEC sampling (Fig. 3a). CD103* dendritic cells had the most differentially 
expressed genes (334) followed by CD103*CD11b* (122) and CD11bt 
(60) macrophages (Fig. 3a, b). Each phagocyte upregulated unique sets of 
genes in response to apoptotic IECs, and only two genes, Plac8 and Itgb7 
(Supplementary Information shows all gene names) were downregulated 
in all three of the groups, suggesting that apoptotic IEC sampling gener- 
ates unique transcriptomes within each phagocytic cell subset. 

Several unique pathways were induced in apoptotic IEC-bearing 
eGFP* macrophages, including those that facilitate internalization of 
cargo and clearance of apoptotic cells (Fig. 3c, d). CD103*CD11bt 
and CD11b* macrophages, which sampled the bulk of apoptotic IECs 
(Fig. 2c, d), significantly upregulated a distinct array of genes involved in 
apoptotic cell clearance, including Gas6, the scavenger receptor Cd163, 
and TAM family receptor Mertk (encoding MER)!, whereas CD103* 
dendritic cells upregulated the endocytic receptor Ly75 (encoding 
CD205), which detects apoptotic cell ligands® (Fig. 3d). Higher 
surface expression of MER in macrophages and CD205 in dendritic 
cells correlated with higher eGFP content, irrespective of diphtheria 
toxin treatment, validating the use of low doses of diphtheria toxin to 
increase apoptotic IEC numbers without major steady-state perturba- 
tions (Extended Data Fig. 6a). Other genes whose expression correlated 
with eGFP content were differentially modulated in macrophages and 
CD103* dendritic cells, including Cx3cr1, TAM family receptors Axl 
and Tyro3, and Stabilin family receptors Adgrb1 (also known as Bail) 
Stab1 and Timd4 (Extended Data Fig. 6b-e, confirming similar profiles 
at 6 h after diphtheria toxin administration), whereas Cd300a remained 
unchanged (Extended Data Fig. 6f). Both macrophage subsets 
expressed Pikfyve, which is essential for phagosome maturation’? 
(Fig. 3c, d). Increased expression of genes involved in lipid processing 
and branched-chain amino acid catabolism was notable in apoptotic 
IEC-bearing CD103*CD11b* and CD11b* macrophages, respectively 
(Fig. 3e), and may reflect the increased catabolism of internalized apop- 
totic cargo metabolites. CD103+CD11b* macrophages upregulated 
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Figure 3 | Sampling apoptotic 
IECs in situ triggers a 
transcriptional ‘suppression of 
inflammation’ signature distinct 
for macrophages (Md) and 
CD103* dendritic cells (DC). 
a, c-g, Hierarchical clustering 
of differentially expressed genes 
upregulated by at least 1.2 fold 
(ANOVA (q < 0.05) and Tukey’s 
honest significant difference (HSD) 
post-hoc test (P< 0.05); 

—1.2 > fold-change > 1.2); 
expressed by row Z scale and 
organized according to Gene 
acaasy Otology as indicated. b, Venn 

| a diagrams of genes in a. Data 
represent five independent 
experiments, four mice per 
experiment, three biological 
replicates. 


@ Lipid metabolism 
CD103CD11b Mb 
eGFP" eGFP* 


CD11b Mb 


eGFP-_eGEP* 


Branched-chain amino 
acid catabolism 


CD11b Mo 
eGFP- eGFP* 


CD103 DC 


Jkamp 


Entpd4 
CD11b Mm 
eGFP- 


eGFP* 
mira 


Htrat 


Thfrsf1 1a a 


Pon3 


Abcal expression, which increases cholesterol efflux and has been 
associated with engulfment of apoptotic cells and liver X receptor « 
function, which upregulates Mertk expression!. Similarly, CD11bt 
macrophages upregulated Acadsb (Fig. 3e and Extended Data Fig. 6d). 

Profiling genes that were downregulated in response to apoptotic IEC 
sampling revealed two notable transcriptional signatures: the down- 
regulation of inflammatory genes and suppression of the immune 
response (Fig. 3f and Extended Data Fig. 6g-i). Different sets of these 
genes were significantly downregulated in CD103*CD11bt and 
CD11b*eGFP* macrophages, including genes that encode the pattern 
recognition receptors Clec4a, Clec4b1, Cd209a and Tir2 (Fig. 3f). 
Alox5ap, which facilitates inflammatory leukotriene biosynthesis, 
was significantly downregulated in both CD103*CD11b* and 
CD11bteGFP* macrophages (Fig. 3f and Extended Data Fig. 6d, h) 
reflecting previous findings that show suppression of 5-lipoxygenase 
by apoptotic cells'’. CD103* dendritic cells also significantly 
downregulated several innate immune genes including Ti/r12, 
inflammasome-related Nirp3, Casp1 and Il1a'* (Fig. 3f), as well as 
MAPK/ERK signalling (Lrrk2, Map3k4, Fos) (Extended Data Fig. 6g, i) 
and Nirp3, Fos at 6 h after diphtheria toxin administration (Extended 
Data Fig. 6e). Classical and alternative complement genes Clqa and Cfp, 
respectively, were also downregulated in CD1037 dendritic cells (Fig. 3f), 
which may further dampen NLRP3 inflammasome activation”. 
Additionally, expression of Cfh, encoding a regulator of complement 
activation, was upregulated in eGFP*CD11b* macrophages (Fig. 3g 
and Extended Data Fig. 6i). Also notable was the significant upregula- 
tion of negative regulators of the immune response (Fig. 3g). CD103* 
dendritic cells upregulated: Nirc5 and Tnfaip3 (A20), which negatively 
regulate the NLRP3 inflammasome, type-I interferon (IFN) signalling 
and NF-«B activity'4; Oasl1, which inhibits IRF7, a type-I IFN tran- 
scription factor'®; and Spred1, which inhibits MAPK signalling!® 
(Extended Data Fig. 6i, e, at 6h after diphtheria toxin administration). 
Coupled with unchanged or downregulated gene expression profiles for 
inflammatory cytokines and co-stimulatory receptors in eGFP* relative 
toeGFP counterparts after diphtheria toxin treatment (Extended Data 
Fig. 7), the overall phagocytic response to apoptotic IEC sampling 
in situ results in genome-wide transcriptional suppression of a set of 
inflammatory genes that is unique to each CD11c* subset. 

In contrast to macrophages, eGFP+CD103* SILP dendritic cells 
showed a distinct transcriptional profile dominated by a ‘regulatory 
CD4* T-cell activation signature (Fig. 4a, Extended Data Fig. 6e 
at 6h post-diphtheria toxin). These dendritic cells significantly 
upregulated genes that are required for regulatory CD4* T-cell (Treg) 
recruitment, including chemokines Ccl22 and Ccl17 (ref. 17), and genes 
that promote T,.g-mediated suppression—including the leucine-rich 
membrane protein Lrrc32, which retains latent TGF3 on Treg cells!® 
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Table 1 | Apoptotic IEC sampling induces transcriptional modulation of IBD susceptibility genes in intestinal lamina propria phagocytes 


CD103*CD11bt M® CD103* dendritic cell 
Gene Regulation Gene Regulation Gene Regulation Gene Regulation 
112b up 112b up Sept1 down Hhex down 
Trpm2 up Lrre32 up Rassf5 down Cast down 
Tmem132a up Pfkfb3 up Ptpn1 down Nusap1 down 
Lsp1 down Gca up Kif11 down 1r2 down 
Rpl3 down Ccd40 up Gpr183 down CD11b*+ M® 
Mrp!20 down Tnfaip3 up Slc25a28 down Gene Regulation 
Ak3 down Spred1 up Fuca2 down M1rl2 up 
Ctsw down Lrrk2 down Mospd3 down Lsp1 down 
Sept1 down Plau down Ikzf1 down Differentially expressed 
Dclre1b down Mrpi20 down Psmd3 down Phagocyte Total Up Down 
$100a10 down Fos down Acyp1 down CD103* CD11bt M&S 13 3 10 
Tnni2 down Notch2 down Sultlal down CD103* DC 30 7 23 
S100a11 down Donson down Bcap29 down CD11b* M® 2 1 1 


Genes exhibiting upregulation or downregulation in each SILP phagocyte population upon apoptotic IEC sampling were overlapped with IBD susceptibility loci, which were downloaded from the GEO 
(accession number GSE57945, http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE57945)*>. Genes in bold are shared. Total numbers of differentially expressed IBD genes is shown in the 


bottom right corner. M®, macrophage. 


(Fig. 4a). Aldh1a2, which catalyses the synthesis of retinoic acid 
to further generate T,.g in the periphery’’, was also significantly 
upregulated (Fig. 4a, Extended Data Fig. 8a), as were genes required 
for T,eg expansion including the suppressive co-stimulatory molecule 
Cd274 (encoding PD-L1)° (Fig. 4a), whose surface expression on 
eGFP*CD103* dendritic cells after diphtheria toxin administration 
was the same as that of untreated VDTR mice (Extended Data Fig. 8b). 
RelB and Cd40, involved in inducing Treg differentiation”°, were also 
significantly upregulated (Fig. 4a). Thus, apoptotic IEC sampling by 
SILP CD103* dendritic cells sets the stage for instructing differentia- 
tion of Treg cells, which are important mediators of intestinal tolerance 
and homeostasis!” 

Highest expression of the chemotactic receptor Ccr7 in the SILP was 
observed in eGFP*CD103* dendritic cells (Extended Data Fig. 8c, 6e 
at 6h), suggesting that CD103* dendritic cells function as migratory 
phagocytes that traffic apoptotic IEC cargo from the lamina propria to 
the mesenteric lymph nodes (MLN). Migratory cells within the MLN 
of VDTR and B6 mice were similar, but eGFPT phagocytes, which 
comprised approximately 10% of all migratory cells, were noted only in 
MLN of VDTR mice and without the need for treatment with diphtheria 
toxin, indicating the shuttling of apoptotic IEC cargo to the MLN 
under steady-state conditions (Fig. 4b and Extended Data Fig. 8d, e, 
B6 mice). Of the migratory subsets, only CD103*CD11b- CD24+ 
dendritic cells were eGFP*, comprising approximately 25% of all CD103* 
migratory cells (Fig. 4b and Extended Data Fig. 8e), whereas MLN- 
resident CD103* dendritic cells were eGFP~ (Extended Data Fig. 8f-k). 

Within the MLN, migratory eGFP*CD103~ dendritic cells had a 
transcriptional profile (Fig. 4c) similar to that of SILP eGFP*CD103* 
dendritic cells (Fig. 4a). Several transcripts affecting Tyeg cells were 
additionally increased in eGFP* relative to eGFP-CD103+ MLN 
dendritic cells, including Ido1 and II10 (ref. 20). Whereas levels of Tgfb1 
transcripts, which encode a protein that facilitates T,e, expansion”, 
were similar (Extended Data Fig. 81, m), those of Itgb8, which encodes 
a protein that activates TGF, were increased by approximately 10- and 
20-fold in eGFP* relative to eGFP~ CD103* MLN and SILP dendritic 
cells, respectively (Fig. 4c and Extended Data Fig. 6e). This equated 
to an approximately 4-fold increase from the SILP to the MLN. T-cell 
co-stimulatory receptors (CD274, CD40) (Fig. 4a, c and Extended 
Data Fig. 8b, n), and secreted signals (retinoic acid (via Aldh1a2), 
CCL22), were also increased relative to eGFP CD103+ MLN den- 
dritic cells (Fig. 4a, c). Consistent with this profile, eGEP*CD103* 
dendritic cells induced an approximately 4-fold increase in Treg cells 
ex vivo, as marked by the Treg-Specific transcription factor FOXP3, 
compared to eGFP~ counterparts (26.5% versus 6.6%, respectively) 
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and without added TGF® (Fig. 4d). No Treg differentiation occurred 
when TGF was added alone without dendritic cells (Fig. 4d). Unlike 
FOXP3~ T cells, these induced FOXP3+ Treg cells expressed the high- 
affinity IL-2 receptor CD25 (ref. 19; Fig. 4e). The number of Treg cells 
also increased in vivo following a single diphtheria toxin injection, 
as shown by increased CD45*CD3*CD4tFOXP3° peripherally 
induced Treg cells marked by lack of Helios expression or transcription 
factors specific for other T-helper cell subsets (Fig. 4f, g and Extended 
Data Fig. 9). Despite a predominant specificity of small intestine Treg cells 
for dietary antigens”', specificities for self-antigens remain possible. 
Thus, although CD103* dendritic cells migrate to MLN regardless of 
the content of apoptotic IECs (Fig. 4b), only CD103* dendritic cells 
carrying apoptotic IECs excelled at de novo Tyeg-cell differentiation. 
The transposition of differentially modulated genes with the 
163 susceptibility loci identified through genome-wide association 
studies on inflammatory bowel disease (IBD) showed modulation 
of 41 IBD susceptibility genes upon apoptotic IEC sampling, with the 
majority expressed in CD103* dendritic cells and some select genes 
shared between macrophages and the CD103* dendritic cells and 
CD103"CD11b™ macrophages (Table 1). Although no single gene 
was expressed in all three populations, CD103* dendritic cells and 
CD103*CD11bt macrophages shared downregulation of Mrpl20 
and Sept1 and upregulation of [/12b, encoding the p40 subunit 
shared by IL-12 and IL-23—and in which single nucleotide polymor- 
phisms are associated with Crohn's disease in Korean populations”. 
Macrophage subsets shared the downregulation of Lsp1, whereas 
CD103* dendritic cells upregulated Pfkfb3. Lsp1 is involved in activa- 
tion of PPAR) that enhances apoptotic cell phagocytosis and, together 
with Pfkfb3, suppresses diet-induced intestinal inflammation”. 
How allelic polymorphisms within IBD susceptibility loci affect 
genes involved in intestinal tolerance and homeostasis is not known. 
However, modelling the colonic epithelial ulcerations and cellular 
infiltration characteristic of IBD through the treatment of VDTR 
mice with DSS (Extended Data Fig. 10a—c) showed downregulation 
of many of the T;eg-cell activation genes differentially expressed both 
upon apoptotic IEC sampling by migratory eGFP*CD103* MLN den- 
dritic cells, such as Aldh1a2, Atrn, Ccl22, Cd274, Idol, I110 and Relb, 
and relative to those from untreated mice (Extended Data Fig. 10d). 
CD103* dendritic cells were the only phagocytic subset in DSS-treated 
VDTR mice that delivered apoptotic IEC cargo to the MLN (Extended 
Data Fig. 10e), but many Tyg activation genes were no longer upregu- 
lated in these cells within the MLN, with expression levels similar or 
even reduced relative to eGFP” counterparts (Extended Data Fig. 10f). 
Taken together, these data suggest that inflammation alters the outcome 
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Figure 4 | Apoptotic cell cargo endow migratory CD103* dendritic 
cells with the ability to induce T,eg-cell differentiation. a, Hierarchical 
clustering of differentially expressed genes upregulated in CD103* 
dendritic cells as in Fig. 3. b, d-f, Flow cytometry of untreated VDTR 
MLN phagocytes (b) and splenic CD4* T cells (d-f) stained after co- 
culture with indicated ex vivo dendritic cells (d, e) or MLN CD4* T cells 
stained ex vivo (f). Numbers indicate percentage of cells within gates. 

b, Migratory phagocytes pre-gated on live CD45*MHCII"CD11c* cells. 
c, (RT-PCR for indicated genes representative of three independent 
analytical repeats with one biological replicate consisting of two mice. 

e, Geometric mean of CD25 (right panel) from d. Data represent two 
independent experiments, 2-3 replicates per condition. f, MLN Treg 
(pre-gated on CD3*CD4* Helios” ) cells from VDTR mice untreated 

(top) and 3 days after 2ng g' diphtheria toxin treatment (bottom). 

g, Quantification of data from f over time. n= 4 mice per time point; n=5 
mice treated with PBS. One-way ANOVA; *P < 0.05. Data are mean + s.e.m. 


of apoptotic cell sampling, and point to a possible disruption of the 
tolerogenic consequences of apoptotic IEC sampling during IBD. 

Apoptotic cells mediate their known roles in immunosuppression 
by pairing downregulation of inflammatory genes with upregulation 
of negative regulatory nodes that simultaneously target more than one 
pathway of inflammation. Apoptotic IECs also bestow the ability to 
instruct T,eg-cell differentiation upon the single intestinal CD103* 
dendritic cell subset tasked with their sampling. Innate immune recog- 
nition of apoptotic IECs is thus a critical component of the mechanisms 
that mediate gut homeostasis*®. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

Mice. Plasmids containing the 9-kb mouse villin promoter (pBS-Villin)*°”” and 
simian diphtheria toxin receptor (HBEGF (‘DTR)) with the enhanced green fluo- 
rescent protein (pDTR-eGFP) fusion gene”* have been previously described. The 
pDTR-eGFP was PCR amplified with primers harbouring a 5’ BsiWI site and 
a 3’ Mlul site and cloned into pBS-Villin. The pBS-Villin/DTR-eGFP plasmid 
was verified by sequencing and the transgene was isolated from the plasmid by 
restriction enzyme digestion and gel purification. The transgene was microinjected 
into fertilized eggs from C57BL/6J mice (Jackson Laboratory) and transferred into 
oviducts of ICR foster mothers as previously described”. Identification of the 
transgenic mice was performed by PCR amplification using the following primers: 
5’-ACTGCTCTCACATGCCTTCT-3’ and 5’-CTTCTTCCCTAGTCCCTTGC-3’. 
For diphtheria toxin administration, mice were injected intraperitoneally with 
2 or 10ng g ' diphtheria toxin (EMD Chemicals) and humanely killed 1-24h 
later’’. Control mice were injected with PBS. For dextran sulphate solution (DSS) 
(MP Biomedicals) studies, mice were supplemented with 3% DSS in the drinking 
water for five days. On day three, water bottles were refilled with 3% DSS solution 
and on day five, replaced with fresh drinking water. Mice were weighed and 
monitored daily for signs of distress, morbidity or mortality during the course of 
the experiment until they were killed on day 7. Both male and female mice ages 6-8 
weeks were used for all studies. All experiments were approved by the institutional 
animal care and use committee and carried out in accordance with the ‘Guide for 
the Care and Use of Laboratory Animals’ (NIH publication 86-23, revised 1985). 
Isolation of professional phagocytes from the SILP. Before isolating professional 
phagocytes (‘phagocytes’), VDTR and VDTR negative littermate controls were 
intraperitoneally injected with PBS (vehicle) or diphtheria toxin (EMD Chemicals) 
at alow (2ng g ') or high (10ng g_') dose per body weight. Mice were then killed 
1-24h later and phagocytes were isolated from the SILP as previously described 
with some modifications”. In brief, the small intestine, including the duodenum, 
jejunum and ileum, was excised and Peyer’s patches removed. Next, the small 
intestine was opened longitudinally with surgical scissors and flushed with ice-cold 
PBS to remove the faecal content. Intestines were then cut into 0.5-cm pieces and 
transferred into 50-ml conical tubes containing 20 ml of PBS. Samples were then 
vigorously shaken for 30s using the vortex genie (Scientific Industries) and passed 
over 100-j1m nylon cell strainers (BD Falcon). Fresh PBS was added to the tissue 
samples and the shaking and filtering process was repeated a total of eight times. 
To isolate and remove the intestinal epithelial cell layer, samples were washed with 
20 ml of warm PBS containing 3 mM EDTA and passed over cell strainers. This was 
repeated three times. Flow-through was kept as purified for IECs, whereas whole 
tissues were further processed to isolate dendritic cell and macrophage subsets. 
Next, samples were washed with ice-cold PBS followed by RPMI 1640 (Sigma) 
containing 5% FBS to remove the EDTA. Samples were then re-suspended with 
RPMI 1640 containing 5% FBS, 1 mg ml“! collagenase D (Roche), and 1mg ml"! 
DNase I (Roche) and incubated in a 37°C water bath for 60 min. Samples were 
shaken every 20 min during this time. At the completion of the incubation, samples 
were washed with FACS buffer to remove the collagenase and then passed through 
an 18-gauge needle followed by a 21-gauge needle to create a single-cell suspension. 
Phagocytes were then enriched from samples by using a 1.065 g ml! OptiPrep 
(Sigma) density gradient according to the manufacturer’s protocol. Following 
centrifugation, phagocytes were isolated from both low- and mid-density bands 
and finally re-suspended in FACS buffer for flow cytometric analyses. 

Flow cytometric analyses. Mouse spleen was digested in parallel with small 
intestine samples and used for single-colour compensation controls. All samples 
were pretreated with Fc block for 10 min at 4°C followed by fluorescently 
conjugated antibody labelling at 4°C for 60 min. The following antibodies were 
used for these studies: Antibodies from BioLegend including Alexa Fluor 647- or 
700-conjugated anti-CD11c (clone N418), PerCP/Cy5.5-conjugated anti-CD24 
(clone M1/69), APC/Cy7-conjugated anti-CD45 (clone 30-F11), APC-conjugated 
anti-CD64 (clone X54-5/7.1), APC-conjugated anti-CD274 (clone 10F.9G2), 
PerCP/Cy5.5-conjugated anti-F4/80 (clone BM8), Alexa Fluor 700-conjugated anti- 
Ly-6c (clone HK1.4), and Phycoerythrin (PE) or Brilliant Violet 421-conjugated 
anti- MHCII I-A/I-E (clone M5/114.15.2); antibodies from eBioscience including 
FITC-conjugated anti-CD4 (clone RM4-5), PE/Cy7-conjugated anti-CD11b 
(clone M1/70) and PE-conjugated anti-CD103 (clone 2E7); and TxRed-conjugated 
anti-CD45 (clone 30-F11) from Invitrogen. Live/Dead Aqua (Life Technologies) 
was used to discriminate viable cells. 

Identification of SILP phagocytes bearing apoptotic IECs. Phagocytes isolated 
from the SILP were surface stained for: APC/Cy7-conjugated anti-CD45, Alexa 
Fluor 700-conjugated anti-CD1 1c, Brilliant Violet 421-conjugated anti- MHCII 
I-A/I-E, PE/Cy7-conjugated anti-CD11b, PE-conjugated anti-CD103, PerCP/ 
Cy5.5-conjugated anti-CD24, and APC-conjugated anti-CD64. The identification 
of phagocytes from VDTR mice with IEC cargo was determined by the presence 
of eGFP and this gate was defined on the basis of C57BL/6J and VDTR™ 


littermate controls that were eGFP~. Sample acquisition was performed using the 
LSRFortessa (BD Biosciences) and data analyses were performed using the FlowJo 
analytical software (Tree Star). 
Cell sorting. To sort phagocytes with and without apoptotic IEC cargo, the 
following surface markers were used: APC/Cy7-conjugated anti-CD45, Alexa 
Fluor 700-conjugated anti-CD11c, Brilliant Violet 421-conjugated anti- MHCII 
I-A/I-E, PE/Cy7-conjugated anti-CD11b, PE-conjugated anti-CD103, PerCP/ 
Cy5.5-conjugated anti-CD24, and APC-conjugated anti-CD64. The identi- 
fication of phagocytes from VDTR mice with IEC cargo was determined by 
the presence of eGFP and this gate was defined on the basis of C57BL/6] and 
VDTR littermate controls that were eGFP. Sorted populations were live, 
CD45*MHCII*CD11c* phagocytes that were either eGFP~ or eGFP* includ- 
ing (i) CD103*CD11b~CD24*CD64~ (hereafter CD103), (ii) CD103*CD11b* 
CD24 CD64* (hereafter CD103 CD11b), and (iii) CD103~ CD11b*CD24— 
CD64* (hereafter CD11b) for a total of six populations. Owing to the four-sample 
sort-maximum of the instrument, the three eGFP* populations were collected 
first and then fresh collection tubes were added for the three eGFP~ populations. 
Cells were sorted directly into 0.5 ml TRIzol LS reagent (Life Technologies) for 
microarray processing (see below). Each sort was performed at 4h following 
diphtheria toxin administration and consisted of 3-4 pooled VDTR mice. The 
following are the cell yield ranges for each subset: 1,000-5,000 eGFP*CD103*; 
3,000-9,000 eGFP*CD103*CD11b*; 10,000-40,000 eGFP*CD11bt; 
4,500-10,000 eGFP-CD103*; 40,000-80,000 eGFP-CD103*CD11b*; and 
30,000-100,000 eGFP~ CD11b*. FACS was conducted on the FACSAria IIu 
SORP (BD Biosciences). The following are the RNA yield ranges for each subset: 
200-2,400 pg eGFP+CD103*; 200-3,000 pg eGFP*CD103*CD11b*; 600-3,000 pg 
eGFP*CD11b*; 200-4,600 pg eGEP~CD103*; 600-5000 pg eGFP-CD103*CD11b*; 
and 450-4,000 pg eGFP”-CD11b*. The purity and identity of each subset was 
validated as indicated in Extended Data Fig. 5 and according to markers as previously 
reported?!. 
Image Stream. For analysis of IEC engulfment by CD11c* phagocytes, single-cell 
suspensions were prepared as described for flow cytometric analyses and acquired 
using the IS 100 Imaging flow cytometer (Amnis Corp). Phagocytes with eGFPT 
cargo were identified as those that contained single nuclei and were CD45", 
CD11ct and MHCII". Data were analysed using IDEAS software (Amnis Corp) 
and spectrally compensated using a compensation matrix generated from the 
following single-colour controls; FITC-conjugated CD4, PE-conjugated MHCII, 
Alexa Fluor647-conjugated CD11c, TxRed-conjugated CD45, and Hoechst stain. 
Real-time qRT-PCR. Total RNA was isolated from mouse small intestine using 
RNeasy mini-kit (Qiagen) and quantified by a spectrophotometer. Reverse 
transcription was performed with Superscript III (Invitrogen) and cDNA was 
synthesized using the Mastercycler ep (Eppendorf). Real-time quantitative 
RT-PCR was conducted in duplicate on a ViiA 7 Real-time PCR System (Life 
Technologies) using TaqMan quantitative PCR Master Mix at a concentration of 
1x (Applied Biosystems) or SYBR Green Real-Time PCR Master Mixes for the 
eGFP and HBEGF (‘DTR) transgenes. Samples were normalized to B-actin and 
relative expression was determined by 2- AAC, method. Forward (FW) and reverse 
(RV) primers for SYBR Green include: 
5’-AGCAAAGACCCCAACGAGAA-3’ eGFP FW, 5'-GGCGGCGGTC 
ACGAA-3! eGFP RV; 5'‘-CACTTTATCCTCCAAGCCACAAG-3’ Hbegf FW, 
5!-CTTTTCCCGTGCTCCTCCTT-3’ Hbegf RV. 

All probe sequences are in the format: 5’ FAM-sequence-BHQ-1 3’ and together 
with forward (FW) and reverse (RV) primer pairs were synthesized by Biosearch 
Technologies. 5’/-AGCCACCCCCACTCCTAAGAGGAGG-3/ Actb probe, 
5!-GAAGTCCCTCACCCTCCCAA-3’ Actb FW, 5’-GGCATGGACGCGACCA-3/ 
Actb RV; 5‘-AAATCGGTGATCCAGGGATTGTTCCA-3’ Acadsb probe, 
5'’-CCTCTGGTTTCCTCTATGGATGA-3’, Acadsb FW, 5'-TCCCT 
CCATATTGTGCTTCAAC-3’ Acadsb RV; 5'-CGGGACAGGGCAA 
CTCTTGCAA-3/ Aldh1la2 probe, 5’-GCTTGCAGACTTGGTGGAA-3’ 
Aldh1a2 FW, 5'-GCTTGCAGGAATGGCTTACC-3’ Aldh1a2 RV; 5’-CCCA 
CTTTCCTTGTGGTACTCTGGAC-3’ Alox5ap probe, 5‘-CAACCAGA 
ACTGCGTAGATGC-3’ AloxS5ap FW, 5'-GAAGGCGGCAGGGACTTG-3’ Alox5ap 
RV; 5‘-TGCCTTTAGTGGCCTCATTGTTCC-3’ Atrn probe, 5’-GGACTCAA 
TCTACGCACCTCTGAT-3’ Atrn EW, 5'-GCCGTCTCATTGCCATCTCTT-3’ 
Atrn RV; 5‘-TTGGCATCAATCTGAGCTGTTGGTG-3’ Axl probe, 
5'-GCCCATCAACTTCGGAAGAAAG-3’ Axl FW, 5’-CCTCTGGC 
ACCTGTGATATTCC-3’ Axl RV; 5‘-AGTGAAGGAGTTCTTCTGGACCTCAA-3’ 
Ccl22 probe, 5‘-CACCCTCTGCCATCACGTT-3’ Ccl22 FW, 5/-ATCT 
CGGTTCTTGACGGTTATCA-3’ Ccl22 RV; 5’-CCACTGCTCATGGATATGT 
TGAACAATAGAGACC-3’ Ccr2 probe, 5‘-AGGGTCACAGGATTAGG 
AAGGTT-3! Ccr2 FW, 5‘-CGTTCTGGGCACCTGATTTAA-3’ Cer2 RV; 5'-CAG 
TGCCCAAGTGGAGGCCTTGATC-3’ Ccr7 probe, 5'‘-CACGCTG 
AGATGCTCACTGG-3’ Ccr7 FW, 5’-ATCTGGGCCACTTGGATGG-3’ 
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Ccr7 RV; 5‘-AGATTCGCTGTCACCAGCACAGACA-3’ Cd40 probe, 5’-TC 
TCAGCCCAGTGGAACA-3’ Cd40 FW, 5'‘-CGGTGCCCTCCTTCTTAACC-3’ 
Cd40 RV; 5‘-CGAATCACGCTGAAAGTCAATGCCC-3’ Cd274 probe, 5’-CG 
GTGGTGCGGACTACAAG-3! Cd274 FW, 5’-CCCTCGGCCTG 
ACATATTAGTTC-3’ Cd274 RV; 5‘-TTCCCAGGGCTTGAGGCTCCC-3/ 
Cd300a probe, 5’-GGCCACCGTGAACATGACTA-3’ Cd300a FW, 5'-GCAG 
GAGAGCTAACACAGACAAC-3/ Cd300a RV; 5‘-ATGGAAAATGG 
GTGGCGTCTAACCCA-3’ Cfh probe, 5‘-CCGAACACTTGGCACTATTGTAA-3’ 
Cfh FW, 5‘-CTCCGGGATGCCCACAAG-3’ Cfh RV; 5’/-CCCTGAAC 
AACCAACAGATGACACTGG-3/ El/f3 probe, 5‘-GGCACTGAAGAC 
TTGGTGTTIG-3’ Elf3 FW, 5'’-CCCTGAACAACCAACAGATGACACTGG-3’ 
EIf3 RV; 5'‘-AGCTGACAGATACACTCCAAGCGGA-3’ Fos probe, 5'-AG 
TGCCGGAATCGGAGGA-3’ Fos FW, 5’-TGCAACGCAGACTTCTCATC-3’ 
Fos RV; 5’-CTGCTCCTGCTGGCTTCCGAGT-3’ Gas6 probe, 
5'-CTGGGCACTGCGCTTCTG-3’ Gas6 FW, 5‘-CGCAACAGCACAGTGTGA-3’ 
Gas6 RV; 5’-TCTTATGCAGACTGTGTCCTGGCA-3’ Ido1 probe, 5’-GGGCC 
TGCCTCCTATTCTG-3’ Idol FW, 5‘-CCCACCAGGAAATGAGAACAGA-3/ 
Ido1 RV; 5'-TCACAAGCAGAGCACAAGCCTGTC-3’ II1b probe, 5’-AAAGAC 
GGCACACCCACCCTGC-3’ I]1b FW, 5’-TGTCCTGACCACTG 
TTGTTTCCCAG-3’ Il1b RV; 5'-TCTGCAAGAGACTTCCATCCAGTTGCCT-3’ 
II6 probe, 5‘-CCAGAAACCGCTATGAAGTTCC-3’ II6 FW, 5’-TCACCA 
GCATCAGTCCCAAG-3’ II6 RV; 5’-TTCAAACAAAGGACCAGCTGGACA-3’ 
1110 probe, 5‘-TCAGCCAGGTGAAGACTTTC-3’ I110 FW, 5’‘-GGCAA 
CCCAAGTAACCCTTA-3’ 1110 RV; 5'-TAACTGGGATCCAGGCACGCC-3’ 
Ly75 probe, 5’-GTCAGACTTCAGGCCACTCAA-3! Ly75 FW, 5’-TGACCCA 
CCAATCACAGGT-3’ Ly75 RV; 5'’-TCCCTTACTTTATTAAGCAG 
CCTGAGAGTG-3’ Mertk probe, 5’-TGATCCCATATACGTGGAAGTTCA-3’ 
Mertk FW, 5‘-CCTGGCAGGTGAGGTTGAAG-3’ Mertk RV; 5'-TTTGC 
GTCTGACTGCCGAGACTC-3’ Muc2 probe, 5’-CCTGGCCT 
CTGTGATTACAAC-3! Muc2 FW, 5'‘-GGTGCACAGCAAATTCCTTGTAG-3’ 
Muc2 RV; 5'‘-TCGCAACCAGATCGGAGATGTGG-3’ Nirc5 probe, 5’-CCA 
GAACTCAGGAAATTTGACTTGA-3’ Nirc5 FW, 5'-TTTGGCAAGA 
TGGCAGCTAA-3’ Nirc5 RV; 5’/-CTGCTGCCTCACTTCTAGCTTCTGC-3’ 
Nirp3 probe, 5‘-GTTGCCTGTTCTTCCAGACT-3’ Nirp3 FW, 5’-GGC 
TCCGGTTGGTGCTTAG-3’ Nirp3 RV; 5‘-TAGGCTGCTTTGGGAA 
TGGCACC-3’ Oasl1 probe, 5‘-CGCGTGCTCAAGGTACTCAAG-3’ Oasl1 FW, 
5'-GACCAGCTCCACGTCTGTAG-3’ Oasl1 RV; 5’-TTGTGATGACTACATG 
GTCACACTCTTC-3’ Plac8 probe, 5’-GAACCCGATACGGCATTCCT-3’ Plac8 
FW, 5‘-TCTTGCCATCCAGCTCCTTAG-3’ Plac8 RV; 5‘-ACCAACACATCG 
GAGCTGCGGA-3’ Relb probe, 5/-GAGCCTGTCTACGACAAGAAGTC-3’ Relb 
FW, 5’-GCCCGCTCTCCTTGTTGATTC-3’ Relb RV; 5’‘-AGTTATGCACGAGT 
GCGAGCTGT-3’ Spred1 probe, 5‘’-CGGCGACTTCTGACAACGATA-3’ 
Spred1 FW, 5'‘-GGTAGCCATCCACCACTTGAG-3’ Spred1 RV; 5'‘-AGAGGTC 
ACCCGCGTGCTAATGGTG-3' Tgfb1 probe, 5’-CCCGAAGCGGACT 
ACTATGC-3! Tgfb1 FW, 5‘-ATAGATGGCGTTGTTGCGGT-3’ Tgfb1 RV; 
5'-CTCTGCCTGCATCCAATCACTCTCA-3’ Timd4 probe, 5'-GGTCCG 
CCTTCACTACAGAATC-3’ Timd4 FW, 5'-GGCCTGAGTACGGCTATGTC-3’ 
Timd4 RV; 5'‘-TGGGCTTTCCGAATTCACTGGAGC-3’ Tnf probe, 5’-ATGC 
ACCACCATCAAGGACTCAA-3! Tnf FW, 5’-ACCACTCTCCCT 
TTGCAGAACTC-3’ Tnf RV; 5‘-TCAACTGGTGTCGTGAAGTCAGGA-3’ 
Tnfaip3 probe, 5’-TCCCTGGAAAGCCAGAAGAAG-3! Tnfaip3 FW, 5'-GA 
GGCAGTTTCCATCACCATTG-3’ Tnfaip3 RV; 5’-TCCGGAGCTAC 
TTCAAGCAAGGC-3’ Vill probe, 5'-GGCAACGAGAGCGAGACTT-3’ Vill FW, 
5’-CGCTGGACATCACAGGAGTT-3’ Vill RW. 

cDNA Microarrays. A total of five sorting experiments with a pool of 3-4 mice 
were performed for the cDNA microarrays. Following cell sorting into TRIzol 
LS reagent, samples were shipped on dry ice to the Center for Functional Genomics 
and the Microarray & HT Sequencing Core Facility at the University at Albany 
(Rensselaer). A sample clean-up step was performed using RNeasy columns 
(Qiagen) that included DNase treatment. The isolated RNA was checked for 
quality using NanoDrop (Thermo Scientific) and Bioanalyzer (Agilent), 
following which 1 ng of total RNA was processed using WT-Ovation Pico RNA 
Amplification System (NuGEN). A total of three biological replicates were used 
for the microarray. When required, RNA was pooled from additional sorts to 
achieve the 1 ng of total RNA needed for the amplification system. The following 
are the sort experiments used for each sample: (2, 2 and 5, 2 and 5) eGFP*CD103*; 
(2, 3, 2 and 5) eGFP*CD103*CD11b*; (2, 4, 5) eGFP+CD1 1b; (3, 2 and 4 and 
5,2 and 4 and 5) eGFP”- CD103°; (2, 3, 4) eGFP- CD103*CD11b’; and (2, 3, 5) 
eGFP-CD11b*. RNA was reverse-transcribed and sense-target cDNAs were 
targeted using the standard NuGEN protocol and hybridized to Affymetrix mouse 
Gene 2.0 ST arrays. These arrays were then washed, stained on a FS 450 station, and 
scanned on a GeneChip 3000 7G scanner using Affymetric GeneChip Command 
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Console Software (AGCC). The Affymetrix microarray data discussed in this 
publication have been deposited in NCBI’s Gene Expression Omnibus (GEO)*” 
and are accessible with the GEO series accession number GSE85682. 

Code availability. Fold changes and statistical significance were identified as those 
genes that were differentially expressed between eGFP* and eGFP~ subsets by 
at least 1.2 fold (ANOVA (Benjamini—-Hochberg false discovery rate correction 
Q<0.05) and Tukey’s HSD post-hoc test (P< 0.05; -1.2> fold >1.2) and deter- 
mined using R software (version 3.2.0). Hierarchical clustering of differentially 
expressed genes meeting the aforementioned criteria were Z-scored and plotted 
with heatmap.2 (gplots version 2.17.0, CRAN/R). Principal component analyses 
of the 1,534 genes (ANOVA (Benjamini-Hochberg false discovery rate correction 
Q<0.05); 4.8% of total) with the most variable expression in each CD11c* subset 
with and without eGFP cargo were generated using R software which are freely 
available online. 

Microscopy. Paraffin sections. Small and large intestine were dissected and 
fixed in 10% formalin (Fisher Scientific) for 24h and then processed for paraffin 
embedding. Tissue blocks were then cut into 5-mm sections, de-paraffinized by 
xylene immersion, and hydrated by serial immersion in 100%, 90%, 80%, 70% 
ethanol and PBS. Antigen retrieval was performed by heating samples in a pressure 
cooker (Cuisinart) in citrate buffer solution (10 mM citric acid monohydrate, 0.05% 
Tween 20 and PBS). Sections were then washed twice in PBS, blocked for 30 min 
in blocking buffer (10% BSA, 0.3% Triton X-100 (Sigma) and TBS), and prepared 
for labelling. TdT-mediated dUTP nick end labelling (TUNEL) was performed 
using the in situ cell-death detection kit, TMR red (Roche), per the manufacturer's 
instructions, stained with DAPI, and mounted using Fluoromount-G (Southern 
Biotech). For cleaved caspase-3 (Cell Signaling), samples were labelled for 60 min 
at room temperature, stained with DAPI, and mounted using Fluoromount-G. For 
paraffin images, eGFP signal was not present owing to sample quenching following 
paraffin embedding and processing. 

Frozen sections. Small and large intestine were dissected and fixed overnight in 
1.6% paraformaldehyde (Thermo Scientific) containing 20% sucrose at 4°C. 
Samples were then placed in OCT (Tissue-Tek) and snap-frozen over dry ice. 
Tissue sections of 8-mm thickness were cut, air-dried and blocked using blocking 
solution. Tissues were then labelled using an Alexa Fluor 594-conjugated phalloidin 
(Invitrogen) or a primary mouse anti-mouse pan-cytokeratin antibody (clone 
PCK-26) (Abcam) for 60 min in a humidified atmosphere followed by a second- 
ary goat anti-mouse Alexa Fluor 594 (Thermo Fisher Scientific) for 30 min, then 
stained with DAPI, and mounted using Fluoromount-G. For fluorescent in situ 
hybridization, small intestine and large intestine were dissected and prepared as 
described for frozen sections**. Following tissue blocking, sections were incubated 
with 0.45 pmol il! eubacterial oligonucleotide probe (AminoC6 + Alexa Fluor 
594) 5/-GCTGCCTCCCGTAGGAGT-3’; (Operon)*? in a pre-chilled hybridization 
buffer (Sigma) overnight at 4°C. Sections were counterstained with DAPI and 
mounted with Fluoromount-G. 

Whole-mount staining. To label small intestine tissues, the whole-mount histology 
protocol was modified from previously described methods™. In brief, small 
intestine samples were excised, opened longitudinally, and washed in ice-cold 
PBS. Samples were then cut to 1 cm in length and placed in 6-ml polypropylene 
tubes (BD Biosciences). Next, samples were incubated with Fc block at 10,1g ml“! 
in 20011 of 2% paraformaldehyde with 1% FBS, 0.3% Triton X-100 in PBS for 
3h at 4°C with gentle rocking. After blocking and fixing, samples were put into 
new polypropylene tubes and labelled using 3 j1g ml! of the following antibodies: 
PE-conjugated anti-CD11c (clone N418) (eBioscience), APC-conjugated 
anti-CD31 (clone 390) (eBioscience) and anti-cleaved caspase-3 at 1:100. All 
labelling was conducted in the dark at 4°C with gentle rocking for 3h. Finally, 
samples were washed for 30 min in the dark at 4°C with fresh PBS and mounted 
for imaging. 

Conventional microscopy was performed using the Eclipse Ni-E motorized 
upright microscope (Nikon) and images were acquired from paraffin, frozen, and 
whole mount tissue sections using a Nikon DS-Qil Mc camera. Cell quantification 
was calculated using NIS Elements imaging software (Nikon) and the object count 
application including intensity of stain thresholds and area restriction filters. 
Confocal microscopy was performed at the Microscope CORE at the Icahn School 
of Medicine at Mount Sinai using the Leica SP5 DM upright microscope and Leica 
LAS AF software. 

In vitro T-cell differentiation. Naive mouse splenic CD4* T cells were isolated 
by sorting with MACS CD4* beads (Miltenyi Biotech) according to the man- 
ufacturer’s instructions and then by FACS using the FACSAria IIu SORP. 
T cells were sorted on the basis of the following criteria: live, CD45*CD3+CD4t 
CD25-CD44~/°"CD62L*/"8", Surface antibodies for sorting included: APC/ 
Cy7-conjugated anti-CD45, eFluor 450-conjugated anti-CD3 (clone 145- 
2c11), PE-conjugated anti-CD4, APC-conjugated anti-CD25 (clone PC61.5), 
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FITC-conjugated CD62L (clone MEL-14), and Alexa 700-conjugated anti-CD44 
(clone IM7) (all eBiosciences). 1 x 10° T cells were then cultured with 1 x 104 
eGFP* or eGFP~ CD103 dendritic cells sorted from the MLN which were identi- 
fied as: live, CD45*MHCII"*CD11ct+, eGEP* or eGEP~, CD103*CD11b~ using the 
aforementioned antibodies for flow cytometry. These cells were cultured in round- 
bottom 96-well plates (Falcon) with complete IMDM (Gibco) supplemented with 
10% FBS, 100 1g ml“! penicillin, 100 pg ml“! streptomycin, 2mM L-glutamine, 
10mM HEPES and 1 nM sodium pyruvate for 5 days. Additionally, all cultures were 
supplemented with 1 jg ml“! of soluble anti-CD3 (clone 2C11) as well as 5ng ml! 
of recombinant human anti-IL-2 (Pepro Tech) on days 2 and 4. A total of 2ng ml! 
of recombinant human anti-TGF$1 (clone 1D11 R&D systems) was added to 
culture wells where indicated on days 1 and 4. On day 5, cells were first surface 
stained with FITC-conjugated anti-CD25, PE/Cy7-conjugated anti-CD4, Alexa 
Fluor 700-conjugated anti-CD3, and APC/Cy7-conjugated anti-CD45, followed 
by fixation and permeabilization (using the concentrate and diluent provided by 
eBioscience), and finally intracellular staining for eFluor 450-conjugated anti- 
FOXP3 (clone FJK-16s), PE-conjugated anti-ROR(t) (clone B2D), PerCP-eFluor 
710-conjugated anti-GATA-3 (clone TWAJ), and APC-conjugated anti-T-bet 
(clone eBio4B10) (all eBioscience). 

Statistical analysis. Data are presented as mean +s.e.m. Statistical significances 
were determined by a one-way ANOVA with Dunnett’s and Newman-Keuls 
post-tests or unpaired two-tailed t-test with Welch correction where specified. 
**P < 0.001, **P<0.01, *P< 0.05. NS, not statistically significant (P > 0.05). 
No statistical methods were used to predetermine sample size. The experiments 
were not randomized and the investigators were not blinded to allocation during 
experiments and outcome assessment. 


Data availability. The Affymetrix microarray data have been deposited in the 
NCBI Gene Expression Omnibus (GEO) under GEO series accession number 
GSE85682. 
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Extended Data Figure 1 | No gross anatomical changes in the small or 
large intestine of VDTR mice. a, The primate diphtheria toxin receptor 
(DTR)-eGFP transgene driven by the mouse villin promoter (mVillin). 
b, qRT-PCR for relative expression of primate diphtheria toxin receptor 
(HBEGF (‘DTR)) mRNA in both the small and large intestine of four 
different DTR-eGFP transgenic (VDTR) founder lines. Founder 32p 
was selected to propagate the VDTR line on the basis of specific 
transgene expression in the intestinal tract and not in other organs. 


Small intestine 


c, d, Haematoxylin and eosin staining (H&E) of small intestine and large 
intestine paraffin sections from treated VDTR (c) and untreated C57BL/6 
(B6) and VDTR (d) as indicated. e-i, Immunofluorescence of small 
intestine (e, f, h) and large intestine (g, i) cryo-sections from indicated 
mice after staining with phalloidin or pan-cytokeratin and DAPI as 
indicated. j, TUNEL (eGFP-quenched after paraffin embedding). p(A), 
polyadenylation. Scale bars, 100 1m (c, d, j), 50 jum (e-i). 
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Extended Data Figure 2 | The intestinal microenvironment does not 
become inflamed following 2 ng g~! diphtheria toxin administration. 
a, Haematoxylin and eosin stain of paraffin-embedded sections from 
small intestine of VDTR mice at 4h following PBS or diphtheria toxin 
administration. b, Flow cytometry of SILP cells from VDTR mice treated 
with PBS or diphtheria toxin. Pre-gated on live CD45*MHCII"* cells. 
Numbers above gates indicate percentage of positively stained cells. 
Quantification is shown in bottom panels. IM, inflammatory monocytes. 
We noted no tissue destruction or discernible inflammation (a), nor 
infiltration of Ly6C™ monocytes* (b) at 4h with either dose of diphtheria 
toxin, in contrast to infiltration of these cells at 16h following 10ng g! 
diphtheria toxin. n =9 mice for PBS, n =6 mice for diphtheria toxin at 
1-4h, and n=3 for PBS and diphtheria toxin at 16 h. One-way ANOVA; 


3% DSS - 5 days PBS - 4hr 


2ngg'DT 10ngg'DT 2ngg'DT 10ngg'DT 


CC3* IECs 

per 150 Villi 
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** P< 0.01; NS, not significant. Flow cytometry gates are representative 
of at least three independent experiments. Data are mean + s.e.m. 

c, d, In situ hybridization with a eubacterial probe on large intestine 
cryo-sections following PBS or diphtheria toxin administration to 
VDTR mice (c), or 3% dextran sodium sulphate (DSS) treatment of B6 
mice for 5 days (d). c, Arrowheads indicate presence of eubacterial probe 
in SILP. There was no bacterial translocation to the intestinal lamina 
propria at 4-h post-treatment with either 2 ng g~! or 10ng g ' diphtheria 
toxin, as evidenced by luminal confinement of the in-situ-hybridized 
eubacterial probe signal, and similar to that in PBS-treated controls. 

e, Immunofluorescence on small-intestine paraffin sections stained with 
antibodies to CC3 at 4h following PBS injection. f, Quantification from 
e and Fig. Ic. Scale bars, 100 1m (a, e), 25 j1m (c, d). 
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Extended Data Figure 3 | Ileum lamina propria CD11c* phagocytes toxin as indicated. b, c, e, eGFP not overlaid. Insets from f and h depicted 
extend dendrites towards apoptotic intestinal epithelial cells. in bottom and right panels, respectively, without eGFP overlay. L, lumen. 
a-h, Conventional (a—e) and confocal (f-h) whole-mount microscopy Arrowhead in a points to a CC3* IEC; in d, toa CD11c* dendrite. Scale 
of excised ileum from B6 or VDTR mice following PBS or diphtheria bars, 251m (a, e), 50j1m (b, d, h), 100j1m (c, f, g). 
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Extended Data Figure 4 | See next page for caption. 
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Extended Data Figure 4 | Flow cytometry gating strategy for identifying 
CD11ct+ and CD11c~”” phagocytes in the SILP of VDTR mice. 

a-e, g, Flow cytometric analyses of SILP cells from VDTR mice. Numbers 
indicate the percentage of gated populations. a, Gating strategy for total 
CD11c* phagocytes. b, Identification of the eGFP* gate on the basis of 

the small intestine cellular profile from B6 (eGFP) mice. c, CD103 and 
CD11b expression on gated cells in b. d, Monocytes pre-gated on live 
CD45*+CD11c!°MHCII"CD11b*Ly6C* cells. e, Granulocytes pre-gated on 
live CD45+CD11¢” MHCII-CD11b*Ly6C" cells. f, Percentage of cells and 
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absolute numbers from b. g, Dendritic cells and macrophages pre-gated 
on live CD45*MHCII"CD1 1c" cells, further identified as CD64~ and 
CD64", respectively. Gating on differential CD64 expression, followed 
by delineation of CD103 and CD11b expression, also distinguished two 
macrophage and three dendritic cell populations®. Data represent at least 
three independent experiments and in f, n = 3 B6 mice; n =6 VDTR mice 
treated with 2 ng g ' diphtheria toxin for 1-4h; n=9 VDTR mice treated 
with PBS; one-way ANOVA; *P < 0.05. Data are mean +s.e.m. 

h, Schematic of SILP phagocytes that sample apoptotic IECs (green). 
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Extended Data Figure 5 | See next page for caption. 
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Extended Data Figure 5 | eGFP*t and eGFP~ CD11c* phagocyte 
populations exhibit distinct transcriptional profiles. a, FACS-sorted 
dendritic cell and macrophage populations from the SILP of VDTR 

mice at 4h following diphtheria toxin administration using the gating 
strategy described in Fig. 2f, which included CD24 and CD64. b, Principal 
Component Analysis (PCA) of the 1,534 genes (ANOVA Q < 0.05; 4.8% 
of total) with most variable expression in dendritic cell and macrophage 
(M@) subsets, with (eGFP*) or without (eGFP~) apoptotic IEC cargo. 
PCA-separated macrophages from dendritic cells (left panel) and eGFPT 
from eGFP~ (right panel). We confirmed the purity and identity of sorted 
phagocytes on the basis of relative expression of genes encoding the 
molecules used for FACS sorting, as well as macrophage and dendritic- 
cell-specific genes. c-g, Signal intensity of genes encoding MHCII 
(H2-Ab1) and CD1 1c (Itgax), shared by both dendritic cells and 
macrophages, the macrophage-specific Csfl receptor (Csf1r), and the 
dendritic-cell-specific kinase Flt3 (Flt3) in CD11c* phagocytes*! (c); 
molecules used for FACS sorting (d); macrophage markers (e); dendritic 
cell markers (f); and IEC markers from analyses of microarray 
experiments (g). It should be noted that IEC-specific genes were 

not increased in eGFP* compared to eGFP~ phagocytes indicating 
negligible contamination by IEC-specific transcripts. Data represent 

five independent experiments with four mice per experiment and three 
biological replicates. White and black bars indicate expression from eGFP~ 
and eGFP* phagocytes, respectively. h, Validation by qRT-PCR 

for IEC-specific genes including Muc2 (mucin-2) and E/f3 (E74-like 
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factor 3), which were not expressed by phagocytes but readily detectable in 
sorted IECs. Conversely, phagocyte-specific transcripts (Aldh1a2, Siglec1 
and Timd4)*! were expressed in sorted phagocytes but not in IECs. Data 
represent two independent experiments depicting relative expression 
(logs). i, (RT-PCR quantification of relative eGFP expression and the 
primate diphtheria toxin receptor (HBEGF or 'DTR’) from VDTR IECs, 
spleen cells, and eGFP~ and eGFP*t phagocytes. As expected, expression 
of the eGFP and DTR (HBEGF) transgenes were confined to IECs in 
VDTR mice and not present in sorted dendritic cell or macrophages. 
Data represent two independent experiments depicting fold change over 
C57BL/6J litter mate controls. j, RT-PCR relative expression (log.) of 
inflammatory transcripts from indicated eGFP~ SILP macrophages over 
time. The unchanged expression of I11b, I16, and Tnfin eGFP~ CD11bt 
and CD103*CD11bt macrophages, which served as sentinels of the 
microenvironment, provided further evidence for lack of inflammation 
induction by low-dose 2 ng g! diphtheria toxin. Note that the unique 
transcriptional profiles of eGFP* phagocytes may belong to pre-existing 
subpopulations that become detectable only when marked by eGFP, owing 
to an inherently superior capacity for apoptotic cell sampling. However, 
we consider it more likely that the differential gene expression patterns 
between eGFP* and eGFP™ populations are a consequence of apoptotic 
IEC internalization. Data represent three independent analytical repeats 
and one biological replicate consisting of three mice per time point. Data 
are mean +s.e.m. 
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Extended Data Figure 6 | Further validation of the differential gene 
expression profiles following apoptotic IEC sampling. a, Flow cytometry 
validation for MER and CD205 (Ly75) protein upregulation either in 

the steady state or 4h after diphtheria toxin administration, reflecting 
transcript data shown in Fig. 3d. Flow cytometry representative of at 

least three independent experiments. b, g, Hierarchical clustering of 
differentially expressed ‘find me/eat me’ receptor (b) and ‘suppression of 
immune response’ (g) genes at 4h after diphtheria toxin administration. 
Differences in expression did (g) and did not (b) meet statistical 
significance of at least 1.2-fold (ANOVA (Q< 0.05) and Tukey’s HSD 
post-hoc test (P< 0.05); —1.2> fold > 1.2) comparing eGFP* and eGFP~ 
phagocytes at 4h following diphtheria toxin administration; expressed 

by row Z scale and relative expression (log). In b, upregulated genes 
correlating with eGFP content included Cx3cr1 (the receptor for the 

‘find me’ signal fractalkine) and TAM family receptor Axl by CD11b* 
macrophages, as well as Tyro3 by CD103* dendritic cells. Different 
members of the Stabilin family of scavenger receptors, Adgrb1, Stab1/2, 
required for apoptotic cell clearance’, as well as Timd4, which binds to 
exposed phosphatidylserine, were differentially expressed by macrophages 
and CD103* dendritic cells. Data represent five independent experiments 
with four mice per experiment and three biological replicates. Many genes 
in b and g remained differentially regulated at 6h after diphtheria toxin 
administration (see d and e), showing a stable profile not unique to the 
4-h time point. c, (RT-PCR validation for Axl and Timd4 in SILP 
phagocytes at 4h after diphtheria toxin administration. d, e, (RT-PCR 


quantification of relative gene expression in the indicated eGFP~ and 
eGFP* populations (log,). CD11b* macrophages (d) and CD103* 
dendritic cells (e) from the SILP at 6h following diphtheria toxin 
administration. Expression of Ccr7, encoding the chemotactic receptor 
required for migration into mesenteric lymph nodes where priming of 
naive T cells occurs, was higher in SILP eGFP*CD103* dendritic cells, 
but not macrophages, relative to their eGFP™ counterparts (Extended Data 
Fig. 8c), and remained high in eGFP*CD103* dendritic cells at 6h after 
diphtheria toxin administration (e) compared to the downregulation by 
eGFP* cells relative to eGFP~ CD11b* macrophages (d). Data represent 
two independent analytical repeats and one biological replicate consisting 
of three mice. f, Signal intensity of Cd300a, encoding an inhibitory 
receptor reported to bind apoptotic cell-exposed phosphatidylserine and 
suppress commensal-driven IFNB production by large intestine lamina 
propria CD103~CD11b*+CX3CRI1* F4/80!°™ cells’, from microarray 
analyses conducted at 4h after diphtheria toxin administration, showing 
similar levels in SILP eGFP* and eGFP~ macrophages and CD103+ 
dendritic cells indicated on the x axis. Data did not meet statistical 
significance of at least 1.2-fold (ANOVA (Q < 0.05) and Tukey’s HSD 
post-hoc test (P< 0.05); —1.2> fold > 1.2) and represent five independent 
experiments with 4 mice per experiment and three biological replicates. 
Data are mean + s.e.m. White and black bar graphs indicate expression 
from eGFP~ and eGFP* phagocytes, respectively. h, i, Microarray 
validation by duplicate RT-PCR for indicated genes represent three 
independent experiments. 
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Extended Data Figure 7 | Inflammatory cytokine and co-stimulatory phagocytes, respectively. No statistically significant changes were observed 
genes in CD11c* phagocytes are unchanged and decreased at 4h and as genes did not meet statistical significance of at least 1.2 fold (ANOVA 
6h, respectively, following sampling of apoptotic IECs. Signal intensity (Q<0.05) and Tukey’s HSD post-hoc test (P < 0.05); —1.2> fold > 1.2). 
of genes encoding inflammatory cytokines (a), and co-stimulatory Data represent five independent experiments with four mice per 
molecules (b), from microarray analyses conducted 4h after diphtheria experiment and three biological replicates. c, d, RT-PCR quantification 
toxin administration on the SILP phagocyte populations indicated on of Il1b and II6 transcripts expressed by eGFP* and eGFP-CD11bt 
the x axes where CD103CD11b and CD11b denote CD103*CD11b* macrophages (c) and CD103* dendritic cells (d) from the SILP at 6h 
and CD11b* macrophages, and CD103 denotes CD103* dendritic cells. after diphtheria toxin administration. Data represent three independent 
White and black bar graphs indicate expression from eGFP~ and eGFP* analytical repeats and one biological replicate consisting of three mice. 
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Extended Data Figure 8 | Resident phagocytes in the mesenteric lymph 
node do not carry eGFP* apoptotic IEC cargo. a, Duplicate qRT-PCR 
quantification of Aldh1a2 transcript levels. b, Flow cytometry validation 
for CD274 protein expression at steady state or at 4h after diphtheria toxin 
administration reflecting transcript data shown in Fig. 4a. Data represent 
at least three independent experiments. c, 1, Signal intensity of Ccr7 (c) 
and Tgfb1 (1) from microarray analyses. Expression of Ccr7 was higher 

in SILP eGFP*CD103* dendritic cells but not macrophages, relative to 
their eGFP~ counterparts. Although Ccr7 transcripts showed a higher 
expression trend in CD103* dendritic cells, relative expression of Ccr7 
and Tgfb1 did not meet statistical significance of at least 1.2-fold 
(ANOVA (Q < 0.05) and Tukey’s HSD post-hoc test (P < 0.05); 

—1.2> fold > 1.2). Data represent five independent experiments with 
four mice per experiment and three biological replicates. d-g, Flow 
cytometric analyses of MLN cells from naive, untreated B6 and VDTR 
mice. d, e, ‘Migratory’ cells were pre-gated on live CD45*MHCII"CD11ct 
cells. Quantification in right (d) and bottom (e) panels. f, ‘Resident’ 

cells were pre-gated on live CD45*MHCII*CD11c* cells and these cells 
were eGFP. n=3 mice per experiment, n= 6 mice per group; unpaired 
two-tailed t-test with Welch’s correction; ***P < 0.001. e, g,eGFP~ and 
eGFP* cells from d were further sorted on the basis of CD64, CD24, 


LETTER 


CD103 and CD11b expression. e, CD24*CD103* dendritic cells from B6 
control mice did not contain the eGFP label. g, The minor populations 
(~2%) of CD64*CD103+CD11bt and CD64*CD11b* macrophages 
within the migratory gate° were also eGFP . Data ine, g, represent three 
independent experiments. h-k, Flow cytometry analyses of migratory 
eGFP~ and eGFP* phagocytes in the MLN over time following a single 
injection of diphtheria toxin. Increasing the number of apoptotic IECs 
with diphtheria toxin treatment did not lead to a statistically significant 
increase in the frequency or absolute number of the indicated macrophage 
populations (h and i, respectively) or eGFP+CD103* dendritic cells (j and 
k, respectively) over time within migratory MLN cells. This was expected 
given the non-inflammatory nature of the low dose diphtheria toxin and 
the small number (~0.5-2 x 10*) of total CD103* dendritic cell within the 
SILP, of which an even smaller fraction (~10%) are eGFP* (Fig. 2d). Data 
represent four independent experiments with 4 mice per time point. Data 
mean + s.e.m. m, qRT-PCR for Tgfb1 from indicated cell populations. 
Data represent three independent analytical repeats and one biological 
replicate consisting of 3 mice. n, Flow cytometry for surface CD40 
expression on migratory eGFPt and eGFP~ MLN CD103* dendritic 

cells from untreated VDTR mice representative of four independent 
experiments. 
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Extended Data Figure 10 | Inflammation does not alter the population 
of phagocytes that traffics apoptotic IEC cargo to MLN, but does alter 
its gene expression. B6 and VDTR mice for 5 days received drinking 
water supplemented with 3% DSS (yellow highlight in b, d). On day 5, 
DSS treatment was stopped and replaced with fresh drinking water. 

a, c, Paraffin haematoxylin and eosin sections from the large intestine (a), 
and small intestine (c) of indicated mice. Scale bars, 100 1m. b, Change in 
weight shown on the y axis monitored daily as shown on the x axis. DSS 
treatment of VDTR mice (green line) was accompanied by weight loss 
similar to that observed in B6 mice (black line), with minimal damage 
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to the terminal ileum, as expected. n = 4 mice per group. d, f, RT-PCR 
for indicated transcripts expressed by migratory eGFP*CD103* MLN 
dendritic cells from untreated and day 5 DSS-treated VDTR mice (d), and 
eGFP~ and eGFP*CD103* MLN dendritic cell isolated from day 5 DSS 
treated VDTR mice (f). #, not detected. Data represent three independent 
analytical repeats and one biological replicate consisting of two mice. 

e, Flow cytometry analyses of MLN migratory cells from day-5 DSS- 
treated and non-DSS-treated mice. Data represent two independent 
experiments consisting of two mice. Numbers near gates indicate the 
percentage of positively stained cells. 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


doi:10.1038/nature20141 


Macrophages redirect phagocytosis by non- 
professional phagocytes and influence inflammation 


Claudia Z. Han!, Ignacio J. Juncadella!, Jason M. Kinchen!, Monica W. Buckley!?, Alexander L. Klibanov*“, Kelly Dryden®, 
Suna Onengut-Gumuscu®”, Uta Erdbriigger®, Stephen D. Turner’, Yun M. Shim*, Kenneth S. Tung?! & Kodi S. Ravichandran! 


Professional phagocytes (such as macrophages’) and non- 
professional phagocytes”-° (such as epithelial cells) clear billions of 
apoptotic cells and particles on a daily basis’. Although professional 
and non-professional macrophages reside in proximity in most 
tissues, whether they communicate with each other during cell 
clearance, and how this might affect inflammation, is not known”. 
Here we show that macrophages, through the release of a soluble 
growth factor and microvesicles, alter the type of particles engulfed 
by non-professional phagocytes and influence their inflammatory 
response. During phagocytosis of apoptotic cells or in response to 
inflammation-associated cytokines, macrophages released insulin- 
like growth factor 1 (IGF-1). The binding of IGF-1 to its receptor on 
non-professional phagocytes redirected their phagocytosis, such 
that uptake of larger apoptotic cells was reduced whereas engulfment 
of microvesicles was increased. IGF-1 did not alter engulfment by 
macrophages. Macrophages also released microvesicles, whose 
uptake by epithelial cells was enhanced by IGF-1 and led to decreased 
inflammatory responses by epithelial cells. Consistent with these 
observations, deletion of IGF-1 receptor in airway epithelial cells 
led to exacerbated lung inflammation after allergen exposure. These 
genetic and functional studies reveal that IGF-1- and microvesicle- 
dependent communication between macrophages and epithelial 
cells can critically influence the magnitude of tissue inflammation 
in vivo. 

To address potential cross-regulation between phagocytes, we 
initially tested a panel of eleven soluble mediators that have been linked 
to tissue repair, inflammation dampening, and tissue morphogenesis", 
for their ability to modulate engulfment by LR73 fibroblasts, a non- 
professional phagocytic cell line. Of the eleven factors tested, only 
insulin-like growth factor 1 (IGF-1) significantly dampened apoptotic 
cell uptake, at concentrations reported in mouse and human serum 
(100-600 ng ml!) (Fig. la, b). Other factors such as EGF, FGF2, 
VEGF, PDGF-AA and PDGF-BB did not alter phagocytic activity 
over a range of concentrations (Extended Data Fig. 1a), although each 
elicited early downstream signalling events (Extended Data Fig. 1b-e). 
The engulfment-dampening effect of IGF-1 was also seen with airway 
epithelial cell lines BEAS-2B and 16HBE140-, and the endothelial cell 
line SVEC-40 (Fig. 1c, Extended Data Fig. 1f, g). 

The IGF-1 effect was not due to masking phosphatidylserine (PtdSer) 
on the apoptotic cells (Fig. 1d). LR73 cells express the IGF-1 receptor 
(IGF-1R), and IGF-1 treatment elicited phosphorylation of Akt, a 
signalling molecule downstream of IGF-1R (Fig. 1b). 

In blood, IGF-1 is bound to IGF-binding proteins (IGFBPs) that sta- 
bilize and sequester IGF-1 (ref. 13). Addition of IGFBP3 with IGF-1 
restored the phagocytic capability of LR73 cells (Fig. le), suggesting that 
binding of active IGF-1 to IGF-1R was necessary. A neutralizing anti- 
body against the human IGF-1R reversed the engulfment-dampening 


effect of IGF-1 on BEAS-2B cells (Fig. 1f). Further, OSI-906 (a small 
molecule kinase inhibitor of IGF-1R currently tested in clinical trials) 
and the inhibitor NVP-AEW541 rescued the engulfment capacity of 
LR73 cells treated with IGF-1, with concomitant decrease in phospho- 
rylation of IGF-1R and Erk (Fig. 1g, Extended Data Fig. 2a). IGF-II 
and insulin, which share structural similarity with IGF-1 but have 
lower affinities for IGF-1R'’, could also reduce apoptotic cell uptake 
(Extended Data Fig. 2b, c), albeit at higher concentrations. Thus, pro- 
ductive signalling through the IGF-1R is necessary for IGF-1 to dampen 
apoptotic cell uptake. 

In contrast to the inhibitory effect on apoptotic cell uptake, IGF-1 
enhanced the uptake of PtdSer-containing liposomes of 150-200 nm 
in size (Fig. 1h). This was not seen with EGF or VEGF. IGF-1 enhance- 
ment required IGF-1R signalling, as the inhibitor OSI-906 reduced the 
increased liposome uptake (Fig. 1i). Thus, IGF-1 can redirect phago- 
cytosis by non-professional phagocytes, suppressing uptake of larger 
apoptotic cells and enhancing internalization of smaller particles. 

The IGF-1 effect was reversible, as washing phagocytes that were pre- 
treated with IGF-1 resulted in near-complete restoration of phagocytic 
activity (Fig. 1j). The effect of IGF-1 on phagocytes was also rapid, as 
adding IGF-1 simultaneously with apoptotic cells inhibited engulfment 
in a similar manner as that seen with the pre-treated cells (data not 
shown), which suggests IGF-1 interferes with early step(s) during 
phagocytosis. Blocking certain early signalling pathways downstream 
of IGF-1R, such as Akt, mTOR, Erk or PI3-kinase", did not rescue 
the IGF-1-mediated decrease in apoptotic cell uptake (Extended Data 
Fig. 3a—d). Although IGF-1 can activate RhoA, which is known to 
decrease apoptotic cell engulfment">, inhibition of RhoA-mediated 
signalling did not reverse the suppression of IGF-1-mediated 
engulfment (Extended Data Fig. 3e, f). Overexpressing activated Racl 
(which promotes apoptotic cell engulfment!®'’) bypassed attenuation 
of apoptotic cell uptake (Fig. 1k), suggesting that IGF-1R acts at or 
before a step regulated by active Racl. As apoptotic cell uptake requires 
both Racl-dependent actin polymerization and de-polymerization'®, 
we explored actin regulation via IGF-1. Cytochalasin D (cytoD), which 
promotes actin depolymerization, potently inhibited phagocytosis of 
apoptotic cells? (data not shown). Although cytoD did not affect the 
basal uptake of liposomes by LR73 cells, it blocked the IGF-1-induced 
increase in liposome uptake (Fig. 11). Notably, cytoD-treated LR73 
cells appeared morphologically normal for the duration of the assay. 
Latrunculin A, which promotes actin depolymerization through a 
different mechanism, also reversed IGF-1-mediated enhancement 
of liposome uptake, without affecting the basal uptake of liposomes 
(Fig. 1m). Additionally, Arp2/3 complex can regulate phagocytosis 
through the formation of branched actin networks; however, CK-666, 
a small molecule inhibitor of Arp2/3, had minimal effect on the IGF- 
1-mediated increase of liposome uptake (Extended Data Fig. 3g). 
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Collectively, IGF-1-mediated modulation of phagocytosis involves 
rapid and reversible modification of F-actin/G-actin dynamics, but 
probably not Arp2/3-mediated functions. 

We next tested the effect of IGF-1 on professional phagocytes. Adding 
IGF-1 to the macrophage cell lines J774 or IC-21, or primary mac- 
rophages (bone-marrow-derived or resident peritoneal macrophages), 
did not affect apoptotic cell uptake, even with supra-physiological 
concentrations of IGF-1, at every time point analysed (Fig. In-p, 
Extended Data Fig. 4a, d). Peritoneal macrophages exposed to IGF-1 
also had comparable liposome uptake as control-treated cells (Fig. 1q). 
Importantly, IGF-1R expression was confirmed in all macrophage cell 
types analysed (Extended Data Fig. 4c) and IGF-1 initiated early signal- 
ling in macrophages, as assessed by IGF-1R and Akt phosphorylation 
(Fig. In, Extended Data Fig. 4a). These data suggest that IGF-1- 
mediated modulation of phagocytosis does not extend to macrophages. 

High levels of serum IGF-1 is linked to IGF-1 production in the liver, 
but macrophages can also produce IGF-1 in response to the cytokine 
IL-4 (ref. 19), raising the possibility that within a tissue, IGF-1 release 
from macrophages could regulate nearby non-professional phagocytes. 
We first confirmed IGF-1 secretion by peritoneal macrophages treated 
with IL-4 (Fig. 2a). Furthermore, resident peritoneal macrophages 
exposed to apoptotic Jurkat cells (but not live cells) produced IGF-1 
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Figure 1 | IGF-1 dampens apoptotic cell 
engulfment and enhances liposome uptake 

by non-professional phagocytes. a, Effect of 11 
growth factors on apoptotic cell engulfment by 
LR73 cells (each normalized to vehicle control). 
Thym., thymocytes. b, Left, engulfment by LR73 
cells treated with increasing concentrations of 
IGF-1. Right, immunoblotting for IGF-1R and 
Akt phosphorylation. c, Apoptotic cell uptake by 
BEAS-2B cells treated with human IGF-1. 

d, IGF-1 does not affect Annexin V binding to 
apoptotic thymocytes (n= 6). e, f, Reversal of 
IGF-1-mediated engulfment inhibition of 

LR73 cells by IGBFP3 (e), or BEAS-2B epithelial 
cells by IGF-1R-neutralizing antibody (f). 

g, IGF-1-mediated reduction in engulfment is 
reversed by the IGF-1R inhibitor OSI-906 (left), 
and immunoblotting for IGF-1R and Erk1/2 
phosphorylation (right). h, Liposome uptake by 
LR73 cells treated with either IGF-1 (50ng ml~'), 
EGF or VEGF (100 ng ml“! each). i, Enhanced 
liposome uptake by IGF-1 is reversed by OSI-906. 
j, LR73 cells pretreated with IGF-1 were washed, 
and incubated with apoptotic thymocytes in the 
presence or absence of IGF-1. k, Engulfment by 
LR73 cells transfected with RacG12V treated and 
with mIGF-1. 1, m, Engulfment by LR73 cells 
treated with cytoD (1) or latrunculin A (m) and 
incubated with liposomes with or without IGF-1. 
n, Left, phagocytosis of apoptotic thymocytes by 
J774 macrophage cells treated with IGF-1. Right, 
immunoblotting for IGF-1R expression and Akt 
phosphorylation. 0, p, Apoptotic cell engulfment 
by bone-marrow-derived macrophages (Mo) or 
resident peritoneal macrophages treated with 
IGF-1. q, Liposome uptake by resident peritoneal 
macrophages treated with IGF-1. For a-q, except 
d, n= 3; representative experiment is shown 

and data are mean +s.d. NS, not significant. 

P value of <0.05 (*), <0.01 (**), or <0.001 
(***). See Supplementary Fig. 1 for uncropped 
immunoblots. 
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(Fig. 2a). IGF-1 protein induction appears to be from newly transcribed 
Igfl mRNA (Extended Data Fig. 5a). Macrophage-produced IGF-1 
could also suppress apoptotic cell uptake by LR73 cells, and this was 
reversed by addition of IGFBP3 (Fig. 2b). 

We chose to examine lung tissue to investigate in vivo communication 
between phagocytes, as alveolar macrophages and airway epithelial cells 
reside in close proximity, and both engulf apoptotic cells’. Alveolar macro- 
phages, similar to resident peritoneal macrophages, are derived from 
fetal monocytes”, and are readily isolated through CD11c*Siglec F* 
expression, whereas the airway epithelial cells can be isolated and 
tracked with available genetic tools’. IGF-1R expression was prominent 
in the airway epithelial cells (Extended Data Fig. 5b, c), but was also 
detectable at lower levels in alveolar macrophages. To test the effect 
of IGF-1 in vivo, we used mice in which airway epithelial cells are 
specifically marked by yellow fluorescence protein (YFP). When 
apoptotic cells or liposomes were administered intranasally with IGF-1 
(Fig. 2c), YFP* airway epithelial cells had decreased apoptotic cell 
engulfment, but enhanced liposome uptake (Fig. 2d, e). Importantly, 
lung macrophages from the same IGF-1-treated mice were unaffected 
in their ability to engulf these targets (Fig. 2d, e). Thus, IGF-1 can 
differentially affect engulfment by professional and non-professional 
phagocytes that reside nearby in the same tissue. 
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Figure 2 | Macrophages produce IGF-1 during apoptotic cell clearance. 
a, Left, IGF-1 secretion by peritoneal macrophages stimulated with 

IL-4, apoptotic Jurkat cells or live Jurkat cells. Right, apoptotic or live 
Jurkat cells do not produce IGF-1 (representative of n = 3), mean +s.d. 

b, J774 cells were treated with recombinant IL-4 (rIL-4). After 24h, half 
of the supernatant was assessed for IGF-1 secretion by ELISA, (right 
panel) and the other half was incubated with PBS or IGBP3 and then 
tested in the phagocytosis assay using LR73 cells (right) (representative 
of n=3), mean+s.d. Note, addition of IGFBP3 increases the basal 
phagocytosis owing to low basal levels of IGF-1 found in the supernatant 
of unstimulated J774 cells. c-e, CCSP-Cre mice with YFP*+ Club cells were 
pre-treated with 1 j1g IGF-1 intranasally for 1h and then administered 
targets with or without IGF-1. Uptake of apoptotic thymocytes (d, n =6 
per group) or liposomes (e, n = 3 per group) by alveolar macrophages and 
airway epithelial cells were then assessed. f, Wild-type mice were given 
PBS, IL-4, apoptotic cells, IL-5, or IL-13 intranasally for 2 consecutive 
days, and the BAL fluid assessed for IGF-1 (n= 2-3 mice per group for 
cytokines, n=5, 8 for apoptotic cell instillation). g, LysM-Cre/Igf1 ++ or 
Igft“" mice were given PBS, IL-4 or IL-13, or apoptotic cells intranasally, 
and BAL fluid assessed for IGF-1 (n=6, 6, 4 mice per group for rIL-4; 
n= 6, 4, 4 for rIL-13; n=6, 9, 9 mice per group for apoptotic cell 
instillation). Data are mean +s.e.m. unless otherwise indicated. NS, not 
significant. P value of <0.05 (*), <0.01 (**), or <0.001 (***), 


We next examined IGF-1 release by alveolar macrophages in vivo. 
Intranasal administration of IL-4 or apoptotic Jurkat cells resulted in 
an increased level of IGF-1 in the bronchoalveolar lavage (BAL) fluid 
(Fig. 2f). When we tested IL-13 or IL-5 (cytokines linked to lung 
homeostasis and inflammation!), IL-13 (whose receptor shares a com- 
mon subunit with the IL-4R), but not IL-5, induced IGF-1 production 
(Fig. 2f). As the source of IGF-1 is difficult to distinguish in these 
experiments, we generated LysM-Cre/Igf1/“" mice, which express 
Cre under the control of the endogenous LysM (also known as Lyz2) 
promoter, and target IGF-1 deletion in the myeloid lineage. LysM- 
Cre/Igf1" mice showed loss of Igfl mRNA in alveolar macrophages 
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Figure 3 | Mice lacking IGF-1R in airway epithelial cells have 
exacerbated airway inflammation. a, Schematic of HDM-induced 
allergic airway inflammation. b, Representative images showing IGF-1R 
expression in bronchial epithelial cells, and its loss in CCSP-rtTA/tetO- 
Cre/Igfir™! mice treated with doxycycline. c, Numbers of eosinophils, 
alveolar macrophages, and CD4* T cells in the BAL fluid of CCSP- 
Cre/Igfir*'* and Igflr“" mice administered PBS or HDM (each dot 
represents a mouse). d, Left, representative lung-draining lymph nodes 
from CCSP-Cre/Igfir*/* and CCSP-Cre/Igf1r" mice that were given 
PBS or HDM. Right, total CD4* T-cell counts from lymph nodes. 

e-h, Representative haematoxylin and eosin (H&E) images (e) or periodic- 
acid-Schiff (PAS) staining (g) of lung sections from CCSP-Cre/IgfIr*!* 
and CCSP-Cre/Igf1r" mice given PBS or HDM (n=3-4 mice per 
condition). Representative histological scoring of inflammation (f) and 
PAS staining (h) (n = 6-10 sections and 3 mice per condition). All data 
are presented as mean +s.e.m. NS, not significant. P value of <0.05 (*), 
<0.01 (**), or <0.001 (#**), 


(Extended Data Fig. 5d) and IGF-1 induction after treatment with IL-4, 
IL-13 or apoptotic cells (Fig. 2g). This suggested that the macrophage/ 
myeloid population is the predominant source of inducible IGF-1 in 
the lung. 

To investigate IGF-1-IGF-1R signalling in inflammation, we used a 
model of airway inflammation induced by the allergen house dust mite 
(HDM)”!, where apoptotic cell recognition by airway epithelial cells 
influences inflammation’. We administered HDM intranasally in the 
allergen sensitization and challenge stage, modelling the natural route 
of allergen encounter (Fig. 3a). When we first tested LysM-Cre/Igfl" 
mice, these mice proved unsuitable, as we continued to detect variable 
levels of IGF-1 in the BAL fluid after HDM administration (probably 
owing to leakage of serum IGF-1 during inflammation). Therefore, we 
targeted instead IGF-1R on the epithelial cells. We crossed Igfl#" mice 
with CCSP-rtTA/tetO-Cre transgenic mice (see Methods), the latter 
driving Cre under the Club cell secretory protein (CCSP; also known 
as Scgb1a1) promoter in the epithelial cells of the trachea, bronchi and 
bronchioles. This CCSP-rtTA/tetO-Cre strain allows for inducible 
Cre expression through doxycycline administration through drinking 
water, thereby allowing normal development and gene deletion just 
before allergen exposure. We observed near-complete loss of IGF-1R on 
epithelial cells of CCSP-Cre/IgfIr™" mice after doxycycline treatment 
(Fig. 3b). After sensitization and challenge with low-endotoxin HDM, 
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the CCSP-Cre/Igf1r" mice had greater airway inflammation based on 
several parameters. (1) There was marked increase in eosinophils and 
CD4* T cells in the BAL fluid (Fig. 3c) and trending increase in inflam- 
matory cells in the lungs (Extended Data Fig. 6a). (2) Lung-draining 
lymph nodes were larger and contained more CD4* T cells (Fig. 3d). 
(3) Lung sections showed increased peribronchial and perivascular 
cellular infiltration (Fig. 3e, f) and greater mucus accumulation after 
HDM treatment (Fig. 3g, h). (4) HDM-treated CCSP-Cre/Igf1 rf! mice 
displayed increased airway reactivity after methacholine challenge, 
a measure of the bronchial hyper-responsiveness (Extended Data 
Fig. 6b). There were more apoptotic cells in lung sections (cleaved 
caspase 3 staining) in HDM-treated CCSP-Cre/Igf1r" mice, prob- 
ably owing to greater inflammation (Extended Data Fig. 6c). These 
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Figure 4 | IGF-1R expression in airway epithelial cells is required in 
the sensitization phase of airway inflammation. a, Schematic of IGF-1R 
deletion before the sensitization phase to assess its effect on early stages 
of inflammation. b, Numbers of eosinophils, alveolar macrophages, 
and CD4* T cells on day 6 in the BAL of CCSP-Cre/IgfIr*/+ and CCSP- 
Cre/Igfir™" mice primed with PBS or HDM. c-e, Analysis of IL-4, 
IL-5, eotaxin-1, and IL-6 (via Luminex (c, e), m= 3 mice per group) and 
TSLP (by ELISA (d), 1 =2, 7, 9 mice per group) in the BAL fluid from 
representative CCSP-Cre/Igfir*/* and CCSP-Cre/Igf1r“" mice primed 
with PBS or HDM. f, Schematic of generation and isolation of alveolar- 
macrophage-derived microvesicles. g, h, Representative negative-stain 
EM (g) or cryo-EM (h) images of microvesicles isolated from mouse 
alveolar macrophages. Images show spherical membrane-bound 
structures of a range of sizes (yellow arrows). i, Representative 
ImageStreamX images of microvesicles isolated from mouse alveolar 
macrophage cell line and primary mouse alveolar macrophages and 
stained for alveolar macrophage markers. j, Tunable resistive pulse 
sensing analysis of microvesicles from alveolar macrophages using 
qNano, pore size 400 nm, to determine frequency and sizing of 
microvesicles (representative of n = 3). k, BEAS-2B cells treated 

with IGF-1 (100 ng ml‘) were assessed for uptake of alveolar- 
macrophage-derived microvesicles (MV) (n= 4). 1, BEAS-2B cells 

were treated with HDM either in the presence or absence of alveolar- 
macrophage-derived microvesicles (MV) for 3h and assessed for 
expression of TSLP, CSF2, IL6, IL8 (n= 4). m, Heat map of 10 most 
differentially expressed genes from RNA-seq analysis of BEAS-2B 

cells exposed to HDM with or without alveolar macrophage-derived 
microvesicles. Data presented as mean +s.e.m. ND, not detected, 

NS, not significant. P value of <0.05 (*), <0.01 (**), or <0.001 (***). 


data suggested a requirement for IGF-1R in the airway epithelial cells 
in minimizing inflammation in this model. 

These observations were initially surprising, as we expected the loss 
of IGF-1R on airway epithelial cells to improve apoptotic cell clearance 
and thereby attenuate, rather than worsen, inflammation. This 
prompted us to examine the temporal requirement of IGF-1-IGF-1R 
signalling in epithelial cells during allergen exposure. To distinguish 
the requirement for IGF-1-IGF-1R signalling at the sensitization 
versus challenge phase, we administered doxycycline at different times: 
deleting IGF-1R expression before allergen sensitization (Fig. 4a), or 
after the initial allergen sensitization but before the challenge phase 
(Extended Data Fig. 7a). Mice with IGF-1R deletion before allergen 
sensitization showed a significant increase in inflammatory parameters 
(Fig. 4b), whereas deletion of IGF-1R after the sensitization phase had 
minimal effect on disease severity (Extended Data Fig. 7b, c). 

Airway epithelial cells that encounter allergens can produce 
cytokines, such as TSLP and CSF-2/GM-CSF (that affect dendritic 
cell maturation), as well as IL-33 and IL-25 that drive type-2 innate 
lymphoid cells (ILC2s) to proliferate and produce IL-13 (ref. 21). 
Human bronchial epithelial cells from individuals with asthma produce 
elevated levels of IL-6, IL-8, and CSF-2 (ref. 22), and TSLP released 
from airway epithelial cells (mice and human) can exacerbate airway 
inflammation”. When we assessed the cytokine profile in the BAL fluid 
of CCSP-Cre/Igf1 7“! mice sensitized with HDM, levels of TSLP and 
IL-6 were greater (Fig. 4d, e). We did not find increased levels of IL-33, 
as seen with Rac] deletion in airway epithelial cells*, but this may be 
due to differences in inflammatory pathways regulated by the loss of 
IGF-1R rather than Racl. These results suggest that IGF-1R in airway 
epithelial cells limits inflammatory cytokines during the initial antigen 
exposure/sensitization phase. 

Regarding trigger(s) for IGF-1 release from the alveolar macrophages 
in the sensitization phase, although IL-4 is often considered a classic 
Ty2 cytokine from the adaptive immune response”!, both IL-4 (refs 24, 
25) and IL-13 (ref. 21) can be produced during the allergen sensitization 
phase by resident mast cells, basophils, and/or ILC2s. In fact, IL-4 was 
enhanced (along with IL-5 and eotaxin-1) in the BAL fluid of CCSP- 
Cre/Igf1r" mice sensitized with HDM (Fig. 4c). Thus, IL-4 and/or 
apoptotic cells present during the initial allergen exposure could elicit 
IGF-1 production from alveolar macrophages. 
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Alveolar macrophages are reported to release microvesicles 
containing anti-inflammatory mediators during smoking-induced 
lung injury”®. As IGF-1 enhances liposome uptake by airway epithelial 
cells, we tested whether microvesicles from alveolar macrophages 
may affect airway epithelial cell response to HDM. Using differential 
centrifugation and filtration, we isolated and characterized microve- 
sicles from alveolar macrophages (Fig. 4f). Negative stain and cryo- 
electron microscopy revealed membrane-bound spherical structures 
>100 nm in diameter (Fig. 4g, h). Through ImagestreamX analysis, 
we determined that the particles from an alveolar macrophage cell 
line or primary mouse alveolar macrophages carried markers of lung 
macrophage origin, CD1 1c or Siglec F, with variable Annexin V staining 
(Fig. 4i). Tunable resistive pulse sensing analysis of microvesicles 
revealed a mean particle size of 357 + 148.5 nm (Fig. 4j), within 
the reported 100-1,000 nm range for microvesicles”’. Further, IL-4 
treatment of macrophages increased microvesicles secretion (Extended 
Data Fig. 8a). In addition, uptake of these microvesicles by BEAS-2B 
airway epithelial cells was enhanced by IGF-1 (Fig. 4k). Notably, adding 
alveolar macrophage-derived microvesicles to HDM-treated BEAS-2B 
cells resulted in significantly lower TSLP CSF2, IL6, and IL8 induction 
(Fig. 41). To further explore this, we performed RNA-seq of BEAS-2B 
cells treated with HDM + microvesicles. Whereas HDM treatment 
increased several known genes associated with asthma in humans?®*°, 
such as FGF2, KLF4, IFIT2, and PTX3, adding microvesicles from 
alveolar macrophages suppressed transcription of these genes in the 
epithelial cells (Fig. 4m, Extended Data Fig. 8b). Thus, microvesicles 
released from alveolar macrophages acting on airway epithelial cells 
exposed to allergen could dampen airway inflammation. 

The data presented here provide several insights into phagocytosis 
and tissue inflammation. It has long been known that professional and 
non-professional phagocytes reside in proximity and can engulf dying 
cells and debris, yet communication between these phagocytes was 
not understood. The data presented here identifies a rapid, transient, 
and reversible regulation, wherein soluble IGF-1 from macrophages 
influences the type of particle taken up by epithelial cells. This 
transient effect might allow the macrophages to temporarily ‘redirect’ 
the non-professional phagocytes towards other function(s). Such a 
possible hierarchy in cell clearance could provide temporal and spatial 
cross-communication within a given tissue’. Our data identifies a two- 
part regulation of epithelial cells by macrophages that affects airway 
inflammation, that is, the secretion of IGF-1 redirects particle uptake, 
and the release of microvesicles dampens inflammatory cytokine 
production by epithelial cells (Extended Data Fig. 9). IGF-1 is a growth 
factor widely linked to growth, cellular proliferation, and ageing. Global 
IGF-1 deletion in mice results in dwarfism and perinatal lethality, and 
IGF-1 mutations are linked to human diseases'*. This work identifies 
a previously unappreciated biological function for the IGF-1-IGF-1R 
signalling axis as a modulator of airway hyper-responsiveness to 
allergens with potential therapeutic relevance. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

Mice. C57BL/6), IgfIr", and Igft”" mice were obtained from Jackson 
Laboratories. CCSP-rtTA/tetO-Cre were provided by J. Whitsett at Cincinnati 
Children’s Hospital*!. To generate the IGF-1R deletion in Club cells, we crossed 
CCSP-rtTA/tetO-Cre mice to Igf1 "fl (ref. 32). To achieve deletion, mice were 
given doxycycline (1 mg ml’) in drinking water containing 0.4% sucrose for 
at least 7 days before beginning of allergen administration, unless otherwise 
noted. Administering doxycycline, a derivative of tetracycline, binds to reverse 
tetracycline-controlled transactivator (rtTA), which then allows rtTA to bind the 
tetracycline response element, resulting in Cre expression. We also crossed Igft!“" 
mice***4 to LysM-Cre mice to conditionally delete Igf1 in the myeloid lineage. 
We have previously reported on the generation of CCSP-Cre/YFP* mice*. For all 
in vivo experiments, except generation of apoptotic thymocytes, mice between the 
ages of 8 and 12 weeks were used. Mice were allocated to experimental groups on 
the basis of genotype and age-matching. Female and male mice were used for all 
experiments, except for in vivo engulfment assays in which only male mice were 
used. All animal procedures were performed according to the protocols provided 
by the Institutional Animal Care and Use Committee (IACUC) of the University 
of Virginia. No statistical tests were used to predetermine sample size. The exper- 
iments were not randomized and the investigators were not blinded to allocation 
during experiments and outcome assessment. 

Induction of airway inflammation. Mice were given drinking water containing 
doxycycline (1 mg ml~’) seven days before first HDM administration. Mice were 
primed intranasally with 101g of low endotoxin house dust mite extraction (Indoor 
Biotechnologies) on days 0, 2, 4 and then challenged intranasally on days 10, 12, 
and 14. On day 16, mice were collected and analysed for eosinophilic airway 
inflammation. Alternatively, mice were given three doses of low endotoxin HDM 
on days 0, 2, and 4 and analysed on day 6 (‘sensitization phase’). For the ‘challenge 
phase} mice that had not received any doxycycline were given three doses of low 
endotoxin HDM on days 0, 2 and 4, then given doxycycline (in drinking water) 
from day 4 until the mice were analysed. These mice were also challenged intrana- 
sally with three doses of low endotoxin HDM on days 10, 12, and 14 and analysed 
on day 16. 

Collection of BAL fluid, lymph nodes, and lung. For airway inflammation 
experiments, 0.8 ml of PBS were delivered intratracheally through a cannula. 
Recovered BAL fluid was centrifuged and the supernatant was frozen at —80°C 
for subsequent Luminex analysis. Collected cells were stained for surface markers 
to distinguish macrophages, neutrophils, T cells, and eosinophils. For lung 
collection, mice were perfused through the right ventricle with PBS and the lungs 
were carefully excised and placed in type-2 collagenase (Worthington Biochemical 
Corporation) dissolved in HBSS containing Ca** and Mg’*. Lungs were minced 
and then incubated at 37°C for one h, with vigorous pipetting every 15 min to 
separate the tissue. The lung homogenate was then passed through a 70-j1m nylon 
strainer, spun down and treated with red blood cell lysis buffer (Sigma-Aldrich) 
for 5 min. The cells were then washed and resuspended in PBS containing 0.1% 
BSA. Draining lymph nodes were carefully extracted, and a single cell suspension 
was made by passage through a 70-;1m nylon strainer using the flat end of a syringe. 
Cells were washed and then resuspended in PBS containing 0.1% BSA. 

Cell staining and total cell numbers. The collected cells were stained for 
macrophages, neutrophils, T cells, and eosinophils using the following markers: 
CD11c (eBioscience, N418), Siglec F (BD Biosciences, E50-2440), Ly6G 
(eBioscience, 1A8), CD11b (eBioscience, M1/70), F4/80 (eBioscience, BM8), 
CD3 (eBioscience, 145-2C11), CD4 (eBioscience, RM4-5), CD44 (eBioscience, 
IM7), CD69 (eBioscience, H1.2F3). Absolute cell numbers were determined using 
AccuCount Particles (Spherotech). Flow cytometry data was collected on FACS 
Canto I (Becton Dickinson) and analysed with FlowJo (Treestar, Inc.). 
Microscopy and histology. For haematoxylin and eosin and periodic-acid— 
Schiff (PAS) staining of lung sections, mice were perfused with PBS and a cannula 
inserted into the trachea. The lungs were gently inflated with 10% formalin at a 
constant fluid pressure at 25 cm. The trachea was tied off and the entire heart and 
lung were removed and placed in 10% formalin. Lungs were paraffin embedded, 
sectioned and stained by HistoTox Labs (Boulder). Additional lung sections were 
embedded and sectioned by Research Histology Core at University of Virginia 
and the immunohistochemical staining for IGF-1R, and cleaved caspase 3 was 
performed by the University of Virginia Biorepository and Tissue Research Facility. 
Approximately 6-10 images were taken per mice, with a total of 3-4 mice per 
group, and blindly scored by two independent scorers for inflammation, PAS 
staining, and cleaved caspase 3 positive cells. Immunofluorescence staining of 
lung sections was performed at the University of Virginia Cardiovascular Research 
Center Histological Services. 

Airway hypersensitivity. Mice were anaesthetized and given a tracheotomy 
tube that delivered increasing concentrations of aerosolized methacholine. The 
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tracheotomy tube in turn was connected to the inspiratory and expiratory ports of 
a volume-cycled ventilator (flexiVent; SCIREQ Scientific). Airway resistance was 
measured at baseline and after each dose of methacholine*. 

Macrophage isolation. To obtain bone-marrow-derived macrophages, femurs 
were removed from 8-week-old mice and flushed with 5 ml of sterile PBS 
containing 5% FBS. The cell suspension was centrifuged, treated with red blood cell 
lysis buffer, washed, and then plated onto sterile Petri dishes in DMEM containing 
10% L929 media, 10% FBS and 1% penicillin/streptomycin/glutamine. Media 
was replenished every 2 to 3 days and differentiated cells were used at day 6 after 
collection. Resident peritoneal macrophages were obtained by flushing the peri- 
toneal cavity of mice with 10 ml of cold PBS containing 5% FBS. Collected cells 
were spun down, resuspended in X-VIVO 10 (Lonza) and plated at a concentration 
of 3 x 10° cells per well in a 24-well plate for IGF-1 secretion assays, and 5 x 10° 
per well in a 24-well plate for engulfment assays. Floating cells were washed the 
next day and remaining peritoneal macrophages were used 2 days after isolation. 
Alveolar macrophages were isolated by flushing the lungs with 1 ml of cold PBS 
instilled intratracheally (five flushes). Collected cells were centrifuged, resuspended 
in F12K media containing 10% FBS and 1% penicillin/streptomycin/glutamine, 
and seeded at 1 x 10° cells per well in a 48-well plate. All floating cells were 
washed away the next day and remaining cells were used in assays 2 days after 
isolation. 

Microvesicle isolation. MH-S, mouse alveolar macrophages, or primary mouse 
alveolar macrophages were seeded. After adherence, the media was replaced with 
0.22-\1m-filtered media to remove any contaminating microvesicles. After an over- 
night incubation, the supernatant was collected and spun at 5,000g to remove cell 
debris and apoptotic bodies. The pellet was discarded and the resulting supernatant 
was filtered through a 0.8 um filter and spun again at 17,000g. The pelleted microve- 
sicles were then washed with HBSS and then spun again at 17,000g. For engulf- 
ment assays, microvesicles were stained with TAMRA for 20 min and then added 
to BEAS-2B cells for 90 min. For flow cytometry, purified particles were stained 
with CD11c, Siglec FE and Annexin V (BD Biosciences, 550475) and processed on 
ImagestreamX imaging flow cytometer (Amnis). Microvesicle size distribution 
was characterized using qNano (IZON Science) with a NP400 membrane and at 
least 500 particles were counted. Microvesicles were prepared for cryo-electron 
microscopy using standard methods and imaged on an FEI TF20 (ref. 36). 

In vivo cytokine or apoptotic cell administration. Mice were administered intra- 
nasally with 1 1g of recombinant mouse IL-4, IL-5, or IL-13 (eBiosciences), 1 x 10° 
apoptotic Jurkat cells, or PBS as control, for two consecutive days. On the third day, 
BAL fluid was recovered and centrifuged; the supernatant was stored at —80°C for 
subsequent cytokine analysis. 

Cytokine and IGF-1 analysis. IL-4, IL-5, IL-6, and CCL-11/Eotaxin-1 in the BAL 
fluid of mice that were sensitized with HDM (first regimen) were quantified by a 
multiplex Luminex performed by the University of Virginia Flow Cytometry Core 
Facility. Secretion of IGF-1 from J774 cells, peritoneal macrophages, and BAL 
fluids, as well as TSLP from BAL fluids, were measured by ELISA (R&D Systems). 
In vitro cell systems. LR73 (hamster fibroblasts), SVEC-40 (mouse endothelial 
cells), BEAS-2B (human bronchial epithelial cells, ATCC #CRL-9609), 16HBE140- 
cells (human bronchial epithelial cells), MH-S (mouse alveolar macrophage cells 
ATCC #CRL-2019), Jurkat (human T cells) were either available in the laboratory 
or obtained from ATCC, with the latter tested for mycoplasma. 

In vitro engulfment assay. LR73, SVEC-40, BEAS-2B, and 16HBE14o- cells were 
seeded in a 24-well plate. Thymocytes were isolated from 4-6-week-old mice and 
induced to undergo apoptosis with dexamethasone. Thymocytes or prepared 
microvesicles were then stained with either CypHer5E (GE Healthcare, PA15401) 
or TAMRA (Invitrogen, C-1171). LR73 and SVEC-40 cells were incubated 
with apoptotic thymocytes at a 1:10 phagocyte to target ratio, BEAS-2B at a 1:5 
phagocyte to target ratio, and 16HBE14o- at a 1:20 phagocyte to target ratio for 
2h. Mouse IGF-1 (Sigma-Aldrich), human IGF-1 (Sigma-Aldrich), human IGF-II 
(Sigma-Aldrich) were added to the phagocytes at the same time as addition of 
apoptotic targets. For IGFBP3 studies, IGFBP3 (Sigma-Aldrich) was added to 
media or supernatant from J774 cells for one h to allow IGFBP3 to bind to any 
available IGF-1, then the mix is added to phagocytes along with apoptotic targets. 
For all pharmacological studies, phagocytes were pre-incubated with the 
compounds listed below for one h before addition of apoptotic targets: cytoch- 
alasin D®” (Sigma-Aldrich, C8273, 1,.M), latrunculin A** (Tocris, 3973, 150nM), 
CK-666 (ref. 39) (Tocris, 3950, 25-100 1M), OSI-906 (ref. 40) (Selleckchem, $1091, 
5-40nM), NVP-AEW541 (ref. 41) (Selleckchem, $1034, 25-100 nM), Rapamycin 
(Sigma-Aldrich, RO395, 10 1M-1mM), MK-2206 2HCI (Selleckchem, $1078, 
10nM-1\1.M), U0126-EtOH (Selleckchem, $1102, 8 pM-5 nM), Wortmannin 
(Sigma-Aldrich, W1628, 50-200 nM), Y27632 (Calbiochem, 688000, 3.75-15,1M), 
or GSK269962 (Tocris, 4009, 80 nM-2|1M). Phagocytes were examined to ensure 
no gross morphological changes occurred owing to drug treatment. Targets were 
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then washed off three times with PBS, and the cells were dissociated from the plate 
with trypsin and the engulfment assessed by flow cytometry. 

In vivo engulfment assay. CCSP-Cre/YFP* mice were administered PBS or 
1g of IGF-1 intranasally. One h later, 100 million CypHer5E-labelled apoptotic 
thymocytes with or without IGF-1 were injected intranasally for 3.5h. The BAL 
fluid was collected and the lungs excised, minced, and digested into a single-cell 
suspension. The cells were then stained with appropriate markers: CD11c and 
Siglec F for alveolar macrophage markers and EpCam (along with YFP expression) 
for airway epithelial cells and analysed by flow cytometry to assess apoptotic cell 
uptake by the airway epithelial cells and the alveolar macrophages. 

Liposome construction. Liposomes were prepared by dissolving the lipids 
(phosphatidylserine, dioleoyl phosphatidylcholine, cholesterol and the lipid DiD 
dye) in chloroform, evaporating chloroform under flow of argon gas in a glass 
vial, then subjecting the lipid layer to overnight lyophilization to remove traces 
of organic solvent. Then normal saline was added for hydration, and intense 
vortexing was performed to prepare multilamellar vesicles (MLV). Liposomes 
were repeatedly filtered through a 0.2-\1m Nuclepore polycarbonate filter to 
prepare smaller particles. Particle size was verified by dynamic light scattering 
using Nicomp 370. 

Immunoblotting. LR73, J774 or BEAS-2B cells were seeded in a 60-mm dish at a 
concentration of 5 x 10°. Cells were serum-starved for 6h and then stimulated with 
100ng ml! of IGF-1 for various time points. Cells were lysed in RIPA buffer and 
used in western blots. The blots were probed for phospho-Erk1/2 (Cell Signaling 
Technology, 4370), phospho-Akt (Cell Signaling Technology, 4060), phospho-IGF-1R 
(Cell Signaling, 3024), total Erk2 (Santa Cruz Biotechnology, sc-154-G), total Akt 
(Cell Signaling Technology, 4691), total IGF-1R (Cell Signaling, 9750), and anti-B- 
actin-HRP (Sigma-Aldrich, A3854) followed by chemiluminescence detection. 
Quantitative RT-PCR. Total RNA was extracted from cells using Quick-RNA 
Miniprep Kit (Zymo Research) or RNeasy Mini Kit (Qiagen) and cDNA was 
synthesized using QuantiTect Reverse Transcription Kit (Qiagen) according to 
manufacturers’ instructions. Quantitative gene expression for mouse Igf1, human 
TSLP, CSF2, IL6, IL8 or housekeeping human or mouse Hprt was performed using 
Taqman probes (Applied Biosystems) using StepOnePlus Real Time PCR System 
(ABI). 

RNA-seq. BEAS-2B cells were treated with HDM and alveolar macrophage- 
derived microvesicles for 3h. Total RNA was extracted and an mRNA library was 
prepared using Illumina TruSeq platform and followed by transcriptome 
sequencing using an Illumina NextSeq 500 cartridge. Four independent 
experiments were sequenced. Rv3.2.2 was used for graphical and statistical 
analysis and the R package DESeq2 was used for differential gene expression 
analysis of RNA-seq data. RNA-seq analysis was performed by the UVa 
Bioinformatics Core. 


Code availability. R code used for bioinformatics analysis and heat map generation 
is available upon request. 

Data availability. RNA-seq data have been deposited in the Gene Expression 
Omnibus under accession code GSE89049. 

Statistical analysis. Statistical significance was determined using GraphPad 
Prism 5 or 6 using unpaired Student's two-tailed t-test, one-sample t-test, one- 
way ANOVA or two-way ANOVA, according to test requirements. Variance was 
similar between groups. No inclusion/exclusion criteria were pre-established. 
Grubbs’ Outlier Test was used to determine outliers, which were excluded from 
final analysis. A P value of <0.05 (indicated by one asterisk), <0.01 (indicated 
by two asterisks), or <0.001 (indicated by three asterisks) were considered 
significant. 
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Extended Data Figure 1 | IGF-1, but not EGF, VEGF, PDFG-AA/-BB, specified growth factors for indicated time and the phosphorylation of 
suppresses phagocytosis of apoptotic cells in non-professional Erk1/2 was determined by immunoblotting (n= 2). f, g, Representative 
phagocytes. a, Representative engulfment assay in which LR73 cells engulfment assay in which the uptake of apoptotic thymocytes by 
were treated with indicated growth factors at increasing concentrations 16HBE140- human airway epithelial cells (f) or SVEC-40 endothelial cells 
and assessed for engulfment of apoptotic thymocytes (n= 3). (g) is dampened by IGF-1 treatment (m= 3). Error bars represent s.d., 
b-e, Serum-starved LR73 cells were stimulated with 100 ng ml! of P value of <0.001 (***). 
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Extended Data Figure 2 | Insulin and IGF-II also decrease apoptotic 
cell engulfment, similar to IGF-1 that is reversed by treatment with 
NVP-AEW541. a, Left, engulfment of apoptotic thymocytes by LR73 cells 
treated with various doses of NVP-AEW541, a small molecule inhibitor of 
IGF-1R (n=3). Right, representative immunoblot of LR73 cells stimulated 
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b, c, Engulfment of apoptotic thymocytes by LR73 cells treated with the 
indicated concentrations of human insulin (b) and human IGF-II (c) 
(n=2). Error bars represent s.d. 
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Extended Data Figure 3 | Blocking canonical signalling intermediates 
downstream of IGF-1 receptor signalling, Rho-kinase- or Arp2/3- 
mediated functions, does not reverse the IGF-1 mediated engulfment 
modulation. a—f, Engulfment of apoptotic thymocytes by LR73 cells 
treated with U2016 (Erk1/2 inhibitor) (a), MK-2206 (Akt1/2/3 
inhibitor) (b), Rapamycin (mTOR inhibitor) (c), or Wortmannin 

(PI 3-Kinase inhibitor) (d), Rho-kinase inhibitors Y27632 (e) or 
GSK269962 (f) in the presence or absence of IGF-1 (n = 2-3). Initially, it 
appeared that inhibition of Rho-kinase was able to partially rescue IGF-1 
induced engulfment suppression. However, as Rho-kinase inhibition 
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basally increases phagocytosis of apoptotic cells (consistent with what has 
been previously reported), we normalized the change in phagocytosis for 
each inhibitor concentration to the appropriate control (right panel). After 
normalizing, we observed that Rho-kinase inhibition did not increase 
apoptotic cell uptake in LR73 cells in the presence of IGF-1 more than the 
increase observed basally owing to Rho-kinase inhibition. Thus, inhibition 
of Rho-kinase does not appear to rescue IGF-1-induced engulfment 
suppression. g, LR73 cells were treated with CK-666 and then assessed 

for uptake of liposomes in the presence of IGF-1. Data represented as 
mean + s.d. 
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Extended Data Figure 4 | Macrophages express IGF-1R and 
phosphorylate IGF-1R upon IGF-1 stimulation, but engulf apoptotic 
cells at normal capacity when exposed to IGF-1 or insulin. a, b, J774 
cells (a) or LR73 cells (b) treated with 100 ng ml! mouse IGF-1 were 
assessed for their ability to engulf apoptotic thymocytes, or serum-starved 
for 6h and stimulated with 100 ng ml~! mouse IGF-1 and assessed for 


phosphorylation of IGF-1R by western blot. c, Flow cytometry histograms 
of IGF-1R expression on J774 cells (left), bone-marrow-derived 
macrophages (middle), and peritoneal macrophages (right) (n = 3-4). 

d, e, IC-21 cells treated with indicated concentrations of mouse IGF-1 (d) 
or human insulin (e) were assessed for their ability to engulf apoptotic 
thymocytes (m= 2-3). Error bars represent s.d. 
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Extended Data Figure 5 | Production of IGF-1 by peritoneal b, c, Lung sections from wild-type mice were stained with antibodies 
macrophages after apoptotic cell or IL-4 stimulation correlates with against alveolar macrophages (Mac-2), airway epithelial cells (CC-10), 
new transcription. a, Peritoneal macrophages were either untreated, and IGF-1R. d, Alveolar macrophages isolated from LysM-Cre/Igft“" and 
stimulated with rIL-4 or apoptotic Jurkat cells and Igfl mRNA (top panels) _littermate controls were assessed for Igfl mRNA expression (n =2 per 
and IGF-1 protein in the supernatant (bottom panels) were assessed in a group, data represented as mean + s.e.m.). 


time course (n.d., not detected) (n = 3). Data represented as mean + s.d. 
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immune cell infiltration in the lungs and more apoptotic cells. c, Representative histology images of cleaved caspase (CC3) staining 
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Extended Data Figure 9 | Model for alveolar macrophage regulation of 
airway epithelial cells, with respect to particle uptake and the response 
to allergens, through IGF-1 and microvesicles. Exposure of airways to 
allergens, such as HDM, can cause apoptotic cell death as well as IL-4 

and IL-13 production, from mast cells and type-2 innate lymphoid cells 
(ILC2s). These cytokines, along with apoptotic cells, trigger alveolar 


a BS IL-4/13 


) (from mast cells, 


i : basophils, ILC2 cells?) 
b Microvesicles 


macrophages to produce IGF-1. The released IGF-1 (a) then acts on the 
airway epithelium to elicit two actions: first, to decrease the uptake of 
apoptotic cells, and second, to enhance the uptake of macrophage-derived 
microvesicles. These microvesicles (b) dampen inflammatory cytokine 
production through airway epithelial cells. 
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Control of mitochondrial function and cell growth 
by the atypical cadherin Fatl 


Longyue L. Cao!*, Dario F. Riascos-Bernal!**, Prameladevi Chinnasamy!”, Charlene M. Dunaway!”, Rong Hou!?+, 
Mario A. Pujato*+, Brian P. O’Rourke’, Veronika Miskolci’, Liang Guo®, Louis Hodgson’, Andras Fiser?® & Nicholas E. S. Sibinga!* 


Mitochondrial products such as ATP, reactive oxygen species, and 
aspartate are key regulators of cellular metabolism and growth. 
Abnormal mitochondrial function compromises integrated 
growth-related processes such as development and tissue repair”, 
as well as homeostatic mechanisms that counteract ageing and 
neurodegeneration’, cardiovascular disease*’, and cancer®’. 
Physiologic mechanisms that control mitochondrial activity in such 
settings remain incompletely understood. Here we show that the 
atypical Fat1 cadherin acts as a molecular ‘brake’ on mitochondrial 
respiration that regulates vascular smooth muscle cell (SMC) 
proliferation after arterial injury. Fragments of Fat1 accumulate in 
SMC mitochondria, and the Fat1 intracellular domain interacts with 
multiple mitochondrial proteins, including critical factors associated 
with the inner mitochondrial membrane. SMCs lacking Fat1 
(Fat1*°) grow faster, consume more oxygen for ATP production, 
and contain more aspartate. Notably, expression in Fat1®° cells of 
a modified Fat1 intracellular domain that localizes exclusively to 
mitochondria largely normalizes oxygen consumption, and the 
growth advantage of these cells can be suppressed by inhibition 
of mitochondrial respiration, which suggest that a Fatl1-mediated 
growth control mechanism is intrinsic to mitochondria. Consistent 
with this idea, Fat1 species associate with multiple respiratory 
complexes, and Fat1 deletion both increases the activity of complexes 
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I and II and promotes the formation of complex-I-containing 
supercomplexes. In vivo, Fat1 is expressed in injured human and 
mouse arteries, and inactivation of SMC Fat1 in mice potentiates 
the response to vascular damage, with markedly increased medial 
hyperplasia and neointimal growth, and evidence of higher SMC 
mitochondrial respiration. These studies suggest that Fat1 controls 
mitochondrial activity to restrain cell growth during the reparative, 
proliferative state induced by vascular injury. Given recent reports 
linking Fat1 to cancer, abnormal kidney and muscle development, 
and neuropsychiatric disease*"'’, this Fat1 function may have 
importance in other settings of altered cell growth and metabolism. 

Proliferation of SMCs is essential for repair of injured arteries, but in 
excess can cause vascular obstruction leading to tissue ischaemia and/or 
infarction'*. SMC expression of the atypical cadherin Fat] is induced by 
both growth factor exposure and vascular injury; however, knockdown 
of Fatl enhances SMC proliferation, and the Fat1 intracellular domain 
(Fatl ICD), expressed as a chimaeric protein fused to the interleukin 
2 receptor a-chain (denoted hereafter as Fat1-IL-2R), is sufficient to 
inhibit growth”. In addition, recent reports link FAT! silencing and/or 
mutation to a spectrum of cancers*’. To understand how Fat! controls 
proliferation, we used tandem affinity purification (TAP) followed by 
peptide mass fingerprinting using matrix-assisted laser desorption/ 
ionization time-of-flight mass spectrometry (TAP-MS) to identify 


Figure 1 | Fat fragments localize to SMC 
mitochondria and interact with inner 
mitochondrial membrane proteins. a, Analysis 
of 30 top-ranked TAP-MS-validated Fatl ICD 

fi interactors. IMM and OMM, inner and outer 
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Figure 2 | Fat1 suppresses SMC growth by restraining mitochondrial 
respiration. a, Growth of mouse aortic SMCs. n= 6. b, OCR of SMCs 
at baseline and in response to oligomycin (1), carbonyl cyanide-p- 
trifluoromethoxyphenylhydrazone (FCCP, an uncoupling agent that 
disrupts ATP synthesis) (2), and rotenone (3). m= 10. c, SMC aspartate 
content. RU, relative units. n =7, analysed by two-tailed t-test. d, SMC 
growth in response to 11M rotenone (rot) or vehicle (veh). n =3. Data 
assessed by two-way ANOVA (a, b, d). All data shown as mean + s.e.m. 


proteins that interact with the Fatl ICD. We noted that a surprisingly 
large fraction of candidate interactors associated with mitochondria, 
including several proteins that localize to the inner mitochondrial 
membrane (Fig. 1a, Extended Data Table 1); these observations were 
replicated in an unbiased independent bioinformatic analysis of the 
TAP-MS dataset (Fig. 1b, Extended Data Fig. 1a). 

We used a conditionally targeted mouse Fat] allele (Fat and 
Cre-mediated recombination to inactivate Fatl expression in SMCs 
(Extended Data Fig. 1b). Cellular fractionation showed Fat] fragments 
in the mitochondrial sample: immunoblotting using a Fat] C-terminus- 
directed antiserum’? identified multiple specific signals between 
~60-100 kDa, including an ~60kDa band seen only in whole-cell and 
mitochondrial fractions (Fig. 1c). Confocal imaging of these cells showed 
Fat1 and mitochondrial co-localization (Fig. 1d). Identification of 
multiple inner mitochondrial membrane proteins in our screen (Fig. 1a, 
Extended Data Fig. la, Extended Data Table 1) suggests that Fatl 
may associate with respiratory complexes; indeed, traditional co- 
immunoprecipitation studies confirmed interaction of the Fat] 
ICD with NDUFS3, a complex-I-specific subunit (Fig. le), and with 
prohibitin, an important regulator of complex I stability and function'® 
(Extended Data Fig. Ic). 

We then assessed growth and mitochondrial function. Resazurin 
(AlamarBlue) growth/viability assays were validated across a range of 
input cell numbers, and performed similarly for both wild-type and 
Fat1*° cells (Extended Data Fig. 2a). Evidence of higher proliferation 
in SMCs lacking Fat1 (Fig. 2a, Extended Data Fig. 2b) was normalized 
by restoring Fat1 expression with Fat1-IL-2R (Extended Data Fig. 2b). 
In assays of mitochondrial respiration using extracellular flux 
(Seahorse) analysis, Fatl KO SMCs exhibited higher baseline and 
maximal oxygen consumption rates (OCRs) compared to wild type 
(Fig. 2b, Extended Data Fig. 2c), with equal coupling efficiency 
(Extended Data Fig. 2d). Coupling efficiency reflects how effectively 
respiratory activity is translated into ATP production", so higher ATP- 
linked oxygen consumption in Fat1° SMCs indicates that these cells 
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generate more ATP. Net ATP levels in wild-type and Fat1<° SMCs were 
similar (Extended Data Fig. 2e), which in turn suggests that consump- 
tion of ATP in Fat1<° SMCs is also increased. Reactive oxygen spe- 
cies, which also result from mitochondrial respiration, were present at 
similar levels under standard conditions, but were higher in Fat 1*° cells 
after exposure to HO (Extended Data Fig. 2f), suggesting a partially 
depleted capacity to buffer or inactivate reactive oxygen species when 
Fat is absent. Aspartate, a metabolite essential for macromolecular syn- 
thesis and cell proliferation, and a third key product of respiration!*””, 
was present at higher levels in Fat18° SMCs (Fig. 2c). Mitochondrial 
respiration is essential for cell growth'®*", including that of SMCs”, 
so we tested its role in the increased growth of Fat1*° cells. Limiting 
respiration by either pharmacologic or genetic interference targeting 
complex I did not compromise wild-type cell growth, but suppressed 
the growth advantage of Fat1®° cells (Fig. 2d, Extended Data Fig. 3a-c). 
These studies suggest that physiologic Fatl expression modulates SMC 
growth by restraining mitochondrial respiration, and that further limits 
on respiration in wild-type cells have no repercussion. By contrast, 
when Fat] is absent, the corresponding loss of this restraint increases 
SMC growth. Overall, these in vitro studies identify Fat] as an impor- 
tant suppressor of SMC growth, and show that potentiation of growth 
in the absence of Fat1 depends on increased respiration. 

Respiration can be affected by changes in mitochondrial structure, 
mass, and/or dynamics, but mitochondria from wild-type and Fat1 KO 
SMCs did not exhibit major ultrastructural differences by electron 
microscopy (Extended Data Fig. 4a-f). Likewise, we found no evidence 
for changes in (1) mitochondrial mass, on the basis of the levels of 
several respiratory chain subunits (Extended Data Fig. 4g), (2) mito- 
chondrial biogenesis and autophagy, on the basis of expression of 
regulators of these respective processes”! (Extended Data Fig. 4h-j), 
or (3) mitochondrial dynamics, as reflected in fusion or fission gene 
expression (Extended Data Fig. 4h, k). Failing to identify differences 
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Figure 3 | The Fat1 ICD limits complex I and II activities, and complex- 
I-containing supercomplex formation in SMCs. a, OCR after introducing 
Fat1-IL-2R or mitochondria-targeted Fatl ICD (Fat1™"°) into Fat1 a 
SMCs. (1), oligomycin; (2), FCCP; (3), rotenone; EV, empty vector. n= 15, 
analysed by one-way ANOVA. b, NADH oxidation by immunocaptured 
complex I from SMC lysates, expressed as fold-increase from wild 

type. Wild type n= 13, Fat1X° n=11.c, Ubiquinol production by 
immunocaptured complex II. n= 16. d, SDS-PAGE and immunoblotting 
of proteins eluted from immunocaptured complex I. e, Native complex 

I (CI) and supercomplexes (SC) containing complex I in mouse SMC 
mitochondria by blue native (BN)-PAGE analysis. f, Quantification of 

SC to CI ratio from BN-PAGE analyses. n = 5. RU, relative units. Data 
analysed by two-tailed t-test (b, c, f). All data shown as mean + s.e.m. For 
gel source data, see Supplementary Fig. 1. 
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Figure 4 | Fatl restrains SMC growth and mitochondrial respiration 
after vascular injury. a, Haematoxylin and eosin (H&E) stain and FAT1 
and ACTA2 immunohistochemistry in restenotic human coronary 
arteries. Black squares are magnified in lower panels. Arrowheads, regions 
of co-expression. Scale bars, 1 mm (upper panels) and 100 1m (lower 
panels). b, Human SMC confocal imaging. Scale bar, 101m. c, H&E- 
stained mouse carotid arteries. Arrowheads, medial hyperplasia. Arrows, 
internal elastic lamina. L, lumen. Scale bar, 100|1m. d, Hyperplasia as 


in mitochondrial structure, mass, or turnover that could explain the 
increased respiration in Fat1 KO SMCs, we surmised that loss of Fatl 
might directly affect respiratory chain activity. 

To assess this possibility, we tested if the Fat] ICD was sufficient to 
control respiration. Expression of Fat1-IL-2R—which inhibits SMC 
growth’—limited respiration in Fat!X° SMCs (Fig. 3a), and yielded 
fragments in the mitochondrial fraction similar in size to endogenous 
mitochondrial Fat1 species (Extended Data Fig. 5a), but this fusion 
protein may also localize to other subcellular compartments. We 
generated and validated a mitochondria-targeted Fatl ICD (Fat1™", 
Extended Data Fig. 5b-d); Fat1 mito repressed both baseline and 
maximal OCRs in Fat1*° cells (Fig. 3a, Extended Data Fig. 5e, f), 
demonstrating that the Fat! ICD can suppress respiration through 
a mechanism intrinsic to mitochondria. Although expression of the 
Fatl-IL-2R fusion protein appeared stronger than that of Fat1™"° 
(Extended Data Fig. 5a, d), the extent of OCR repression achieved was 
similar, perhaps indicating that only a subset of the Fat1-IL-2R-derived 
fragments are active in this capacity. To localize this effect within the 
respiratory chain, we assessed the activities of isolated respiratory 
complexes. Complexes I and II, but not IV or V, were more active in the 
absence of Fat1 (Fig. 3b, c, Extended Data Fig. 6a, b). Moreover, Fatl ICD 
species of ~60-100 kDa were observed in association with immuno- 
captured complex I, II and V, but not complex IV (Fig. 3d, Extended 
Data Fig. 6c). To see if Fatl affects respiratory complex integrity, we 
looked at intact mitochondrial complexes by blue native (BN)-PAGE 
analysis and found that in Fat1X° SMCs, complex I incorporation 
into supercomplexes is increased, without a significant change in the 
levels of native complex I (Fig. 3e, f, Extended Data Fig. 6d). It has been 
proposed that supercomplex conformation facilitates electron transfer 
without increasing production of reactive oxygen species”*»”*; however, 
the factors regulating supercomplex assembly are largely unknown. 
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h, OCR of SMCs from ligated arteries, 3 days after injury. i, Basal 
respiration from h. n =7, analysed by two-tailed t-test. Data analysed by 
two-tailed Mann-Whitney test (d-g). All data shown as mean + s.e.m. 


Orthogonal resolution of BN gels under denaturing conditions indi- 
cated a shift of complex I subunits towards supercomplex confor- 
mations in Fat1-deficient SMCs and no major differences in native 
complex II (Extended Data Figs 6e, 7). We observed ~60-100-kDa 
Fatl species co-migrating with complex I, IJ, III, and V, but not 
complex IV. Notably, a subset of these species also co-migrated with a 
supercomplex containing at least complexes I and III (Extended Data 
Fig. 6e). These findings suggest that Fat] limits SMC mitochondrial 
respiration by repressing complex I and II activities, and by opposing 
the incorporation of complex I into supercomplex structures. 

To assess Fatl relevance to clinical vascular disease, we examined 
segments of diseased post-mortem human arteries adjacent to sites of 
prior stent placement. FAT1 expression coincided in many areas with 
ACTA2, a marker of SMCs (Fig. 4a). As it did in mouse SMCs, FAT 1 in 
cultured human aortic SMCs (HASMCs) co-localized with mito- 
chondria (Fig. 4b). Moreover, knockdown of FAT1 in HASMCs 
increased both cell proliferation (Extended Data Fig. 8a—c) and basal 
and maximal mitochondrial respiration (Extended Data Fig. 8d, e). 
Thus the effects of Fat] on SMC mitochondrial activity and growth are 
conserved across species. 

To test SMC Fat1 function in vivo, we crossed Fat1'°* and Tagln-cre 
mice”* to generate Fat1'°*/"“P Tagin-cre (denoted hereafter as Fat 1SM®°, 
for Fat1 smooth muscle cell knockout) animals (Extended Data 
Figs 1b, 9a, b). Baseline arterial structure appeared normal, with mini- 
mal Fat1 expression (Fig. 4c, Extended Data Fig. 9a). After vascular 
injury”, Fatl appeared in the media and neointima of wild-type arteries, 
co-localizing with Acta2 and showing a perinuclear distribution 
consistent with findings in isolated cells (Figs 1d, 4b, Extended Data 
Fig. 9a, b). Injured arteries from Fat1SMK° mice showed markedly 
higher medial SMC hyperplasia and neointimal growth (Fig. 4c-f), 
increased phospho-histone H3 and cyclin D1 (Fig. 4g), and increased 
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staining for nitrotyrosine, a marker of increased superoxide stress 
(Extended Data Fig. 9c). Moreover, SMCs from injured Fat 1SM®° vessels 
exhibited higher OCRs than those from injured control vessels (Fig. 4h, i, 
Extended Data Fig. 9d). Thus, deletion of Fat 1 in SMCs in vivo permits 
an exaggerated vascular injury response characterized by SMC hyper- 
proliferation and enhanced mitochondrial respiration, which supports 
the idea that Fatl induction after vascular injury serves as a physiological 
regulator of SMC growth, in part by controlling mitochondrial function. 

Asa large, type-I transmembrane protein, full-length Fat1 appears 
an unlikely direct regulator of mitochondrial function. Previous studies 
describe cell-type-dependent cleavage of Fat1 that yields multiple 
smaller species, including Fat1-ICD-bearing derivatives of ~85 and 
~65kDa”*. Although the precise structure of the physiologic mitochon- 
drial Fat1 species that we have identified remains to be determined, 
in silico analysis of potential Fatl C-terminal peptides that start at 
amino acids 4195, 4203, or 4478 yield, on the basis of MitoProtII?” 
analysis, respective probabilities of 0.5881, 0.1140, and 0.1548 for 
export to mitochondria. Moreover, a mitochondrial function for 
Drosophila Fat, an atypical cadherin most closely related to mammalian 
Fat4 (ref. 28), has recently been reported; a 68 kDa Drosophila Fat cyto- 
plasmic fragment interacts with and stabilizes complex I”*. This last 
finding contrasts notably with the increased respiratory activity that we 
find with Fat] inactivation, and probably partly reflects the structural 
divergence between these cadherins”’. Nevertheless, the finding that 
both Drosophila Fat and mammalian Fat] participate in mitochondrial 
regulation is intriguing and perhaps suggests an ancient shared linkage 
between primordial cell surface proteins and control of mitochondrial 
function. Whether Fat4 is active in mitochondria, and how these distinct 
members of the Fat family might cooperate to regulate mitochondrial 
function, remain to be determined. 

Although the factors or conditions that trigger Fat] processing and 
translocation of C-terminal fragments to mitochondria have yet to be 
defined, our results suggest a relatively direct mechanism for sensing 
the extracellular milieu and relaying signals to mitochondria to control 
respiratory chain activity and thereby affect other key metabolic 
functions. This mechanism could potentially be targeted in hyperpro- 
liferative conditions in the cardiovascular system, as shown here, or in 
other scenarios. To this point, the loss or mutation of Fat] (refs 8, 9) 
and the gain of Fatl expression!®*° have both been linked to different 
cancers. In addition, mutations in the FAT1 gene have recently been 
implicated in abnormal renal development’! and skeletal muscle forma- 
tion’, and FAT1 sequence variants have been associated with bipolar 
disorder, schizophrenia, and autism spectrum disorders'’. The potential 
contribution of mitochondrial dysregulation in these divergent 
scenarios is an area for further study. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

Protein interaction studies. Tandem affinity purification and mass spectrometry 
(TAP-MS) was performed using an IL-2R-Fatl-ICD fusion protein’ (Fat1-IL-2R) 
cloned into the pCTAP vector (240104, Agilent) to place tandem streptavidin 
and calmodulin binding peptides (SBP, CBP) at the C terminus of the chimaeric 
protein. A negative control construct included the IL-2R sequence, but not the Fat] 
ICD. These constructs were transfected into 293T cells (ATCC), from which total 
protein lysates were sequentially purified with SBP and CBP beads (InterPlay TAP 
kit, Agilent). After gentle washing, purified protein complexes were eluted from 
CBP beads and concentrated (UFC500396, Amicon Ultra), resolved by SDS-PAGE, 
stained with Coomassie, and submitted for MS/MS analysis (Protein Core Facility, 
Columbia University) to identify Fatl ICD associated proteins. MS/MS-validated 
proteins were scored on the basis of their abundance in the Fat1-IL-2R sample in 
comparison to the IL-2R control using the ProteoIQ software. 

Bioinformatic analysis of the TAP-MS results addressed the 175 potential Fat] 
ICD protein interactors with a MASCOT ratio (Fatl ICD over control) greater 
than 2. To identify novel interactors, we used information on known protein- 
protein interactions available in the STRING database*!. We downloaded 
all proteins that interacted with the 175 candidate interactors at the lowest 
confidence level in the database (STRING scores greater than 0.15), and calcu- 
lated the Page Rank for each node (protein) in the STRING network at different 
confidence levels (low, 0.15; medium, 0.40; high, 0.70; highest, 0.90) using the 
Perl module Graph::Centrality::Pagerank, available in the CPAN repository 
(http://www.cpan.org/). Nodes with high PageRank fractions refer to highly 
connected proteins more likely to be pulled down by chance. From the intersection 
of proteins with ‘high and ‘highest’ STRING confidence scores, we isolated 108 
novel Fat1 ICD interactors (Page Rank cutoff of <0.005) and performed enrich- 
ment analysis with the DAVID Bioinformatics Resource”. Results were plotted 
using the EnrichmentMap plugin® in Cytoscape 3.1 (ref. 34). 

Interaction of the Fat] ICD and endogenous NDUFS3 was assessed using the 

IL-2R and Fat1-IL-2R plasmids in TAP-based co-immunoprecipitation and immu- 
noblotting. To evaluate Fat1-prohibitin (PHB) interaction, PHB cDNA amplified 
by RT-PCR from mouse spleen RNA was cloned into pCS2-6XMyc vector to 
generate a C-terminally tagged PHB—Myc. DelN Fat1-ICD CMV 13-Flag® and/ 
or PHB pCS2-6XMyc plasmids were transfected into 293T cells. Protein lysates 
(300 1g) were pre-cleared with Protein G PLUS agarose beads (sc-2002, Santa Cruz) 
and mouse IgG] isotype control (M075-3M2, MBL). PHB-Myc was captured with 
c-Myc beads (sc-40 AC, Santa Cruz), and co-immunoprecipitated proteins were 
eluted by boiling and assessed for c-Myc and Flag using immunoblotting. 
Cell culture and imaging. Wild-type and Fat 1X° mouse aortic SMCs were isolated 
from aortas of 5-week-old control and Fat1SM®° mice!!! by collagenase-elastase 
digestion*®. SMCs were maintained in Dulbecco’s modified eagle medium 
(DMEM) containing 20% fetal bovine serum (FBS), 1% L-glutamine, 100 U ml“! 
penicillin, and 100,.g ml! streptomycin, and were passaged every three to four 
days. Cells between passages two and five were used for experiments; these early 
passage cells were not tested for mycoplasma. To evaluate population growth 
using the AlamarBlue assay (BUF012A, BIO-RAD), 5,000 cells were plated per 
well of a 96-well plate and followed for up to 96h. Absorbances at 570 nm and 
600 nm were measured on a Synergy 2 microplate reader (BioTek). Reduction of 
AlamarBlue correlates with the number of viable cells, and was calculated accord- 
ing to the manufacturer's specifications and expressed as fold-change with respect 
to baseline. Where indicated, treatment with 0.02-20 1M rotenone or DMSO 
(vehicle) was administered at 0 and 24h. Human aortic smooth muscle cells 
(HASMCs, Clonetics) were maintained in Medium 199 supplemented as indicated 
for DMEM, and passaged every four to five days. 

To assess proliferation, mouse SMCs were incubated with EdU for six 
hours before being fixed, permeabilized, and stained for EdU and total DNA 
(Hoechst stain), as advised by the manufacturer (C10337, ThermoFisher 
Scientific). Fluorescent signals (excitation:emission (Ex:Em) = 495:519 nm 
for EdU, 350:461 nm for Hoechst) were measured on a Synergy 2 microplate 
reader (BioTek), and EdU incorporation was expressed as the ratio of EdU 
to Hoechst signals. In some studies, wild-type and Fat1X° cells were trans- 
fected with 25 nM of control or Ndufs3 small interfering RNA (siRNA) (sense 
sequence 5‘-GCAGAACCGUUUUGAGAUUTT-3’; s86598, Ambion) on 
60mm culture dishes using TransIT-X2 (MIR6000, Mirus). The following day, 
cells were transferred onto 96-well plates at 2,000 cells per well, and grown for 
an additional three days before EdU incorporation was measured. HASMCs 
were transfected with 12nM of control or FAT1 siRNA (sense sequence 
5’/-CAGCGAAUGUAACAGUACAtt-3’; 5035, Ambion) on 60 mm culture dishes 
using TransIT-X2 (MIR6000, Mirus). Two days after transfection, cells were trans- 
ferred onto 96-well plates at 3,000 cells per well and grown for an additional two 
days before EdU assay. 
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Co-localization studies were performed as follows: SMCs were seeded onto glass 
coverslides, stained with 60 nM Mitotracker Red (M-7512, Life Technologies) for 
40 min at 37 °C, fixed with 4% paraformaldehyde (PFA) in phosphate buffer saline 
(PBS) for 15 min at 37°C, blocked overnight (0.2% milk, 2% normal goat serum, 
0.1M glycine, 2% BSA, and 0.15% Triton-X-100 in PBS), and stained for Fat1 using 
specific antisera!® and Alexa Fluor 488 goat anti-rabbit IgG secondary antisera in 
0.2% BSA in PBS. SMCs lacking Fat1 and omission of the primary antibody served 
as controls for signal specificity. Fluorescent signals were visualized using a 4-D 
spinning-disk confocal microscope (Perkin-Elmer) with a 40 x (1.3 NA) objective 
attached to a digital camera (Orca ER; Hamamatsu). Images were analysed with 
the Image-J (v1.48v) software. 

Electron micrography of wild-type and Fat1X° SMCs was performed in the 

Albert Einstein College of Medicine Analytical Imaging Facility using standard 
fixation and imaging procedures. Micrographs were analysed using the Image-J 
(v1.48v) software. 
Mitochondrial studies. Cellular fractions highly enriched for mitochondria were 
isolated using the Qproteome Mitochondria Isolation Kit (37612, Qiagen). Cells 
were lysed and further disrupted by passing the lysate through a 26-gauge, blunt- 
end needle, and mitochondria were isolated using differential centrifugation, 
followed by density gradient separation. Compartmental markers such as BiP and 
GAPDH (Fig. 1c, Extended Data Fig. 5c) indicated efficient though not absolute 
fractionation of mitochondrial components. 

Mitochondrial oxygen consumption rate (OCR) was measured using a XF96 
Extracellular Flux Analyzer (Seahorse Biosciences) following manufacturer's 
instructions. Mouse SMCs were plated on a Seahorse 96-well assay plate at 5,000 
cells per well, and tested in minimal assay media containing 5 mM glucose, 4mM 
pyruvate, and 2mM t-glutamine. The following drugs were injected to achieve 
the indicated final concentration: oligomycin (241g ml~1), FCCP (31M), and 
rotenone (241M). Total protein was isolated from each well and quantified for 
normalization after the assay. When indicated, Fat1X° SMCs were electroporated 
(Neon System, Life Technologies) with plasmids encoding Fat1-IL-2R, Fatl™©°, or 
empty pcDNA3.1 vector, immediately seeded at 6,000 cells per well onto Seahorse 
plates, and allowed to recover for three days before measuring OCR. To evaluate 
respiration in HASMCs, cells were transfected with 12 nM of control or FAT1 
siRNA (s5035, Ambion) on six-well culture plates using TransIT-X2 (MIR6000, 
Mirus). Three days after transfection, HASMCs were re-plated onto Seahorse plates 
at 3,000 cells per well, and OCR was measured as described. To evaluate respiration 
in SMCs after vascular injury, ligated arteries and their uninjured counterparts 
were collected from control and Fat1SM*° mice three days after arterial ligation 
using the collagenase-elastase method**, seeded on a Seahorse 96-well plate at 
15,000 cells per well, and allowed to recover for ~80h before OCR was measured. 
OCRs from validated mouse aortic SMC cultures were used to normalize and 
account for technical variability between experiments. 

To target the Fatl ICD to mitochondria, we assembled the Fat1™"° construct, 
which includes the first 58 amino acids (with the mitochondrial targeting 
sequence) of the mouse thioredoxin-2 protein (GenBank accession NM_019913) 
and amino acids 4,215-4,591 of mouse Fat1, and cloned it into the pcDNA3.1 
expression vector. Mitochondria-specific localization of this fusion protein was 
confirmed by cellular fractionation (Qiagen) of SMCs and 293T cells, followed by 
SDS-PAGE and immunoblotting with Fat! antisera’>. 

ATP was measured using a bioluminetric assay (30020-1 Biotium, Inc.) in 
wild-type and Fat1° SMCs. Micromolar concentrations of ATP were calculated 
using a standard curve and normalized to percentage of AlamarBlue reduction as 
a surrogate for cell number to estimate net ATP levels per cell. Data were expressed 
as fold change respect to wild type. 

Reactive oxygen species (ROS) levels were assessed by transfection of wild- 
type and Fatl KO Cells with a thiol redox-sensitive ratiometric sensor, reduction- 
oxidation-sensitive GFP (roGFP, Addgene plasmid 49435)°” using TransIT-X2 
(MIR6000, Mirus). After two days, cells were imaged on CELLview glass bot- 
tom dishes (5662-7860Q, USA Scientific) containing buffer with divalent cations 
(125mM NaCl, 5mM KCl, 1mM KH>PO,, 5mM glucose, 10 mM NaHCOs;, 1mM 
MgCh, 1mM CaCh, and 20 mM HEPES) at 37 °C**. Live cell images were acquired 
using IX81ZDC wide-field microscope (Olympus) and a single CoolsnapHQ2 
(Photometrics) cooled CCD-camera on the bottom 100% throughput port of the 
microscope. For ratiometric determination of cellular ROS content, cells were 
sequentially imaged with a 40x magnification objective lens (Olympus; UIS2 40X 
DIC N/A 1.3) using excitation light from a 100 W Hg arc lamp and bandpass filter 
FF390/40 (Semrock) (targeting the 400 nm excitation peak of the oxidized form of 
roGFP) and bandpass filter ET 470/40 (Chroma Technology) (targeting the 484nm 
excitation peak of the reduced form of roGFP). Fluorescence emissions were 
detected through a bandpass filter ET525/50 (Chroma Technology). Neutral density 
filters of 10% to 25% transmission (ND 1.0-0.6) were routinely used to reduce the 
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intensity of the excitation light to prolong cell viability. The camera exposure times 
for 390 nm and 470 nm excitations and the respective emission detections at 525 nm 
were maintained at a 2:1 ratio to maximize the light collection within the dynamic 
range of the detector. Typically, this was on the order of 200-600 ms, depending 
on the expression levels of the roGFP probe in a cell. Cells were imaged at 10-s 
intervals for 5 min, and for an additional 15 min after stimulation with 50 1M 
H2O>. Images were processed as previously described”. In brief, images were 
corrected for camera noise and for uneven illumination within the field of view, 
followed by background subtraction and a histogram-threshold-based masking. 
At each time point, processed images for each excitation wavelength were divided 
to obtain the ratiometric (390:470 nm) value, which was then expressed as fold 
increase above the average baseline (before H2O> stimulation) ratio. 

Aspartate levels were measured using a colourimetric assay (ab102512 Abcam) 
in total cell lysates from wild-type and Fat1X° SMCs. Data was corrected by 
subtracting the background as suggested by the manufacturer. Nanomoles of 
aspartate per sample were calculated using a standard curve and normalized to 
mg of protein per sample as a surrogate for cell number, and data were expressed 
as fold change respect to wild type. 

Respiratory complex activities and Fat1-complex association were assessed as 
follows. Mitochondrial complex I activity was evaluated in wild-type and Fat1° 
SMCs using the complex I activity microplate assay kit (ab109721, Abcam) in 
accordance with the manufacturer's instructions. The rate of oxidation of NADH 
to NAD* was measured after immunocapturing complex I from 200 1g of total 
protein lysate. Enzymatic (diaphorase) activity was calculated as the change in 
absorbance (1,000 x (optical density at 450 nm)) per minute (abbreviated as mOD/ 
min). Complex II activity was evaluated using the corresponding assay (ab109908, 
Abcam), measuring ubiquinol production after immunocapturing complex II from 
100 1g of total protein lysate, with enzymatic activity measured as a decrease in 
absorbance at 600 nm per minute (mOD/min). Complex IV activity was tested 
using the corresponding assay (ab109911, Abcam), with the rate of cytochrome 
C oxidation measured after immunocapturing complex IV from 701g of total 
protein lysate, and enzymatic activity measured as a decrease in absorbance at 
550 nm per minute (mOD per min). Complex V activity was measured in human 
aortic SMCs treated with control or FAT] siRNA using the ATP synthase enzyme 
activity microplate assay kit (ab109714, Abcam). Hydrolysis of ATP to ADP and 
phosphate was measured after immunocapturing complex V from 90 1g of total 
protein lysate. Enzymatic activity was measured as a decrease in absorbance at 
340 nm per minute (mOD per min). Negative controls were run in every assay, 
a Synergy 2 microplate reader (BioTek) was used to measure absorbance, and 
data were expressed as fold-change respect to wild-type cells. After the enzymatic 
assays, plates were washed with PBS, and proteins were eluted from immunocap- 
tured respiratory complexes by adding 10 11 per well of RIPA buffer with protease 
inhibitors plus 6x Laemmli protein loading buffer, and incubating the plate at 
85°C for 10 min. Recovered proteins were studied by immunoblotting for Fat1 
fragments and respiratory complex subunits. 

Respiratory complex conformations were evaluated by blue native (BN)/SDS- 
PAGE. First dimension BN-PAGE was performed by solubilizing wild-type and 
Fat1*° mitochondria isolates with 1% Digitonin and complex resolution using 
NativePAGE 3-12% Bis—Tris gel electrophoresis (Life Technologies). Second 
dimension analysis involved reducing and alkylating separated proteins, loading 
the gel strip onto a 10% Tris-Glycine gel, and performing SDS-PAGE, following by 
immunoblotting for specific complex components and Fat1. Native Complex I and 
supercomplex levels were analysed with the Image-J (v1.48v) software. 

Animal models. Tagln-cre and Fat1!°*?/'°*P mice have been described 
previously!!*4, We crossed Tagin-cre (129SV background) and Fat 1'*/*? (129SV/ 
C57BL6 mixed background) mice to generate Fat1 loxP/loxP (control) and Fat 1!*P/9x? 
Tagln-cre (Fat1SM®°) mice, using a breeding strategy to produce both genotypes 
within the same litter. Eight to ten week old male mice were studied. All ani- 
mals were housed in pathogen-free conditions. Procedures followed rules and 
regulations of the AAALAC, and were approved by the Institutional Animal Care 
and Use Committee (IACUC) of Albert Einstein College of Medicine. Vascular 
injury in mice was performed using the carotid artery ligation model, as described 
previously’. Mice were anaesthetized with ketamine/xylazine. The left common 
carotid artery was ligated with 6-0 suture just proximal to the carotid bifurcation, 
and the unmanipulated right carotid artery served as a control. Carotid arteries 
were collected at different time points up to 14 days after ligation. For collection, 
mice were euthanized with ketamine/xylazine and exsanguination, and the central 
vasculature was flushed with PBS and perfused with 4% PFA in PBS for 7 min. 
Arteries were removed, post-fixed with 4% PFA in PBS overnight, processed 
before being embedded in paraffin, and cross-sectioned (51m thickness) for 
subsequent studies. Morphometric analysis of arteries was performed using the 
ImageJ software as follows: the area of the lumen, the area inside the internal 
elastic lamina, and the area inside the external elastic lamina were measured in 


pixels, and the areas of the intima and media were calculated. The size of the 
neointima was expressed as the intima:media ratio. Areas of medial hyperplasia 
were measured in pixels and expressed as percentage of the total medial area. 
For immunohistochemical evaluation, arterial sections were deparaffinized and 
rehydrated. Endogenous peroxidase activity was neutralized before tissues were 
boiled in antigen retrieval solution (H-3300, Vector Labs). Sections were then 
blocked (2% BSA, 10% normal goat serum, and avidin blocking solution (Vector 
Labs) in PBS), incubated with primary antibody (in 2% BSA, 10% normal goat 
serum, and biotin blocking solution (Vector Labs) in PBS), incubated with biotiny- 
lated secondary antibody (Vector Labs) at a concentration of 1:500, incubated with 
ABC reagent (PK-6100, Vector Labs), incubated with DAB/substrate/chromagen 
system (K3467, Dako), and finally, counterstained with haematoxylin (H-3404, 
Vector Labs). Omission of primary antibodies served as controls for signal 
specificity, and the samples were imaged using a COOLSCOPE digital microscope 
(Nikon). For immunofluorescence studies, arterial sections were deparaffinized, 
rehydrated, and boiled in antigen retrieval solution (H-3300, Vector Labs). Sections 
were then incubated in blocking solution (0.3% Triton-X-100, 2% BSA, and 10% 
normal goat serum in PBS) overnight. Primary antibodies were applied in blocking 
solution, and fluorochrome-conjugated secondary antibodies were applied in 0.3% 
Triton-X-100, 2% BSA in PBS. Omission of primary antibodies served as controls 
for signal specificity. The sections were mounted with Fluoro-Gel II (with DAPI) 
mounting medium (17985-50, Electron Microscope Sciences) and visualized with 
an Axio Observer.Z1 (Zeiss) fluorescence microscope. 

Human coronary artery studies. Staining of human coronary artery samples 
was performed using formalin-fixed paraffin-embedded human coronary artery 
atherosclerotic plaque sections obtained from the CVPath Institute Sudden 
Cardiac Death Registry. To evaluate restenotic samples, we chose artery segments 
adjacent to the site of a bare-metal stent implanted within the preceding 30 days. 
H&éE stain and immunohistochemistry for FAT1 and ACTA2 were performed. 
The immunohistochemical staining was developed by NovaRED kit (Vector 
Laboratories, Burlingame, CA). The images were captured by Axio Scan. Z1 (Zeiss, 
Germany) using a 20x objective, and figures were prepared on the HALO image 
analysis platform (Indica Labs, Corrales, NM). 

Antibodies. We used antibodies against Fat] (ref. 15) (1:10,000 for western blot 
(WB), 1:2,000 for immunofluorescence (IF), 1:2,000 for immunohistochemistry 
(IHC)), BiP (1:5,000 for WB, 610979, BD Biosciences), Flag (1:15,000 for WB, 
F3165, Sigma), cyclin D1 (1:1,000 for WB, 1:200 for IHC, 29788, Cell Signaling), 
total OXPHOS cocktail (1:1,000 for WB, ab110413, Abcam), LC3 I/II (1:1,000 
for WB, 2775S, Cell Signaling), complex I NDUFA9 (1:1,000 for WB, ab14713, 
Abcam), complex II 70 kDa Fp subunit (1:10,000 for WB, 459200, Invitrogen), 
complex III subunit core 1 (1:2,000 for WB, 459140, Invitrogen), complex IV 
MTCOI (1:2,000 for WB, ab14705, Abcam), complex V ATP synthase subunit 
@ (1:1,000 for WB, 459240, Life Technologies), ACTA2 (1:200 for IHC, M0851 
Clone 184, DAKO), phospho-histone H3 (1:150 for IHC, 9701, Cell Signaling), 
transgelin (1:250 for IF, ab14106, Abcam), calponin 1 (1:200 for IF, ab46794, 
Abcam), pecam1 (1:200 for IF, ab28364, Abcam), anti-mouse IgG HRP (1:10,000, 
115-036-072, Jackson ImmunoResearch Labs), Alexa Fluor 488 conjugated 
anti-rabbit (1:250, 111-545-047, Jackson ImmunoResearch), Alexa Fluor 546 
conjugated anti-mouse (1:250, A-11003, Molecular Probes). The following 
antibodies were from Santa Cruz: complex I NDUFS3 (1:600 for WB, sc374282), 
c-Myc (1:20,000 for WB, sc789), 3-catenin (1:800 for WB, sc7963), PGC1-a 
(1:150 for WB, sc13067), PGC1-8 (1:100 for WB, sc373771), Tfam (1:150 for WB, 
sc23588), Mfn1 (1:600 for WB, sc50330), Mfn2 (1:600 for WB, sc50331), Acta2 
(1:200 for IF, sc32351), nitrotyrosine (1:250 for IF, sc32757), GAPDH (1:8,000 for 
WB, sc25778), anti-rabbit IgG HRP (1:10,000, sc2030), and anti-goat IgG HRP 
(1:10,000, sc2056). 

PCR. RNA was isolated from wild-type and Fat1X° SMCs with Trizol (Life 
Technologies), and cDNA was generated using Superscript III (Life Technologies). 
Quantitative reverse-transcription PCR (qRT-PCR) was performed using 
the SYBR Green system (Roche) on a Lightcycler 480 instrument (Roche). The 
relative abundance of each transcript was calculated using the AA“ method, 
with the Rpl13a housekeeping gene as reference. We used the following PCR 
primers. 

Mouse genotyping. Fat1'°*? allele forward 5'-GTCTGTGTTTGGCCTG 
AAGACGTA-3’, Fat1!°*? allele reverse 5‘-TTGATGTTGGTGTTGGGAAG 
TGCT-3’; cre transgene forward IMR1084 (Jackson Labs), cre transgene reverse 
IMR1085 (Jackson Labs); Fat14 allele forward 5‘-GICTGTGTTT GGCCTGAA 
GACGTA-3’, Fat 14 allele reverse 5/-TAGACAAACCATCACAGGCTGCCT-3’. 
qRT-PCR. Ppargcla forward 5‘-GAAAGGGCC AAACAGAGAGA-3/, 
Ppargcla reverse 5'-GTAAATCACACGGCGCTCTT-3’; Tfam forward 
5'-CAAAGGATGATTCGGCTCAG-3’, Tfam reverse 5‘-AAGCTGAATATA 
TGCCTGCTTTTC-3’; Nrfl forward 5‘-TGGAGT CCAAGATGCTAATGG-3’, 
Nrfl reverse 5‘-GCGAGGCTGGTTACCACA-3’; Opal forward 5‘-ACCAGG 
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AGAAGTAGACTGTGTCAA-3’,Opal reverse 5‘-TCTTCAAATAAA 
CGCAGAGGTG-3’; Dnm II forward 5’-GCTAGTCCACGTTTCACCAGA-3’, 
Dnmi1I reverse 5‘-TCCATGTGGCAGGGTCAT-3’; Mfn1 forward 5'‘-GTGAG 
CTTCACCAGTGCAAA-3’, Mfn1 reverse 5’-CACAGTCGAGCA 
AAAGTAGTGG-3’; Rpl13a forward 5'‘-GCTTACCTGGGGCGTCTG-3’, Rpl13a 
reverse 5/-ACATTCTTTTCTGCCTGTTTCC-3’. 

Statistical analysis. In general, at least three independent experiments were 
performed for each dataset, with a minimum n number of three biological 
replicates. No statistical methods were used to predetermine sample size for 
in vitro experiments. Comparisons between two groups were analysed by two-tailed 
t-test, and comparison between three or more groups were assessed by analysis of 
variance (ANOVA) followed by post-hoc test where appropriate. Significance was 
accepted for P< 0.05. All statistical analyses were performed and P values were 
obtained using the GraphPad Prism software 6.0. In the animal studies, the sample 
size was estimated” for a power of 80% and type I error of 0.05 to detect a 1.5- or 
1-fold change in the size of neointima measured as the intima:media ratio 3 or 
14 days after ligation, respectively. No animals were excluded. The breeding 
strategy was set up to generate both genotypes of interest within the same litter 
and mice were allocated into two groups according to their genotypes and not by 
randomization. The investigator who performed carotid artery ligation remained 
blinded to group allocation during experiments and outcome assessments. 
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* TAP-MS: String Analysis 


ORs Mitochondrion-related 
enriched group of 
Fat1 interactors 


Group GO descriptions: 


G001 mitochondrial part 

G002 organelle inner membrane 
G003 mitochondrial membrane 
G004 organelle envelope 

G005 envelope 

G006 mitochondrial inner membrane 
G007 mitochondrial envelope 
G008 mitochondrion 

G012 nucleoid 

G013 mitochondrial nucleoid 
G018 mitochondrial matrix 
G019 mitochondrial lumen 
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Extended Data Figure 1 | Fat1 ICD associates with inner mitochondrial 
membrane proteins. a, Description of subgroups within the mitochondrial cluster 
identified by STRING, PageRank, and DAVID enrichment analysis of TAP-MS 
results. GO, gene ontology. b, Western blotting for Fatl expression in mouse aortic 
SMCs. c, Co-immunoprecipitation of Fatl ICD and prohibitin (PHB) in 293T 
cells. IP, immunoprecipitation; performed with IgG (control) or Myc antibody, as 
indicated. WB, western blot. For gel source data, see Supplementary Fig. 1. 
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Extended Data Figure 2 | Loss of Fat1 increases expression of pro- 
proliferative gene products and ATP turnover, but does not affect basal 
ROS levels in SMCs. a, AlamarBlue assay in mouse aortic SMCs plated 

at specified, matched cell densities, showing correlation of AlamarBlue 
reduction across a range of input cell numbers (n= 3). b, Expression of 
cyclin D1 and 8-catenin in mouse aortic SMCs with indicated treatments. 
EV, empty vector. c, Quantification of OCR in mouse aortic SMCs from 
Fig. 2b (n= 10). d, Coupling efficiency calculated as percentage of 
reduction in OCR after oligomycin treatment (n = 10). e, Net ATP levels 
in mouse aortic SMCs. RU, relative units (1 =5). f, ROS levels in mouse 


aortic SMCs, measured with a redox-sensitive ratiometric sensor, roGFP. 
Left, Ratiometric (390/470 nm) value, representing the oxidative state of 
the sensor and intracellular ROS levels, and expressed as fold increase 
above the average baseline (before HO? stimulation) ratio (n= 15). 
Right: Images of cells showing representative ROS levels detected by the 
ratiometric sensor at baseline (t= 0) and at different time points after 
HO, stimulation. Data analysed by two-way ANOVA (a, ¢, f); and two- 
tailed t-test (d, e). NS, not significant. All data shown as mean + s.e.m. For 
gel source data, see Supplementary Fig. 1. 
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Extended Data Figure 3 | Fat1 suppresses vascular SMC growth by mouse aortic SMCs treated with control siRNA (sictl) or Ndufs3 siRNA 
inhibiting the electron transport chain. a, Population growth of mouse (siNdufs3). c, Proliferation of mouse aortic SMCs after siNdufs3 treatment, 
aortic SMCs in the presence of various concentrations of rotenone, a expressed as the ratio of EdU to Hoechst signal. RU, relative units. n =3 
complex I inhibitor. Addition of rotenone at concentrations from 0.1-2.M for Fat1S° siNdufs3, n=5 for other groups; significance assessed by one- 
did not compromise wild-type cell growth; by contrast, Fat1*° cell way ANOVA. NS, not significant. All data shown as mean + s.e.m. For gel 
growth was suppressed to wild-type levels (n = 3); significance assessed source data, see Supplementary Fig. 1. 


by two-way ANOVA. b, Western blotting for NDUFS3 expression in 
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Extended Data Figure 4 | Loss of Fat1 does not affect overall oxidative phosphorylation proteins from each mitochondrial respiratory 
mitochondrial structure, mass, or dynamics in vascular SMCs. complex in total cell lysates of mouse aortic SMCs. h, qRT-PCR analysis 

a, Electron microscopy imaging (original magnification, 5,000 x) of of mitochondrial biogenesis and fusion/fission markers, normalized to 
mitochondria in mouse aortic SMCs. b-f, Quantification of electron Rpl13a expression. n= 3. i, Protein expression of biogenesis markers. 
micrographs for the number of mitochondria per cytoplasmic area j, LC3 levels in 3T3-L1 cells and Atg5 knockout cells (positive and negative 
(b, n= 11); mitochondrial area per cytoplasmic area (c, n= 111 for wild controls for autophagy, respectively), as well as in wild-type and Fat1 KO 
type, n= 119 for Fat1S°); mitochondrial circularity (d, n= 111 for wild SMCs. Conversion of LC3 I to LC3 I is indicative of autophagic activity. 
type, n= 119 for Fatl KO). mitochondrial crista length (e, n = 34 for wild k, Expression of Mitofusins 1 and 2, regulators of mitochondrial fusion. 
type, n=45 for Fat1*°); and crista width (f, n = 34 for wild type,n=45for NS, not significant. All data shown as mean + s.e.m., significance assessed 
Fat1*°). AU, area units; RU, relative units. g, Expression of representative by two-tailed t-tests. For gel source data, see Supplementary Fig. 1. 
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Extended Data Figure 5 | Mitochondria-targeted Fat1 ICD is sufficient 
to repress oxygen consumption in vascular SMCs. a, Electroporation of 
Fat1-IL-2R in mouse aortic SMCs, followed by subcellular fraction and 
SDS-PAGE analysis of Fatl ICD fragments. Green bracket, endogenous 
Fatl ICD species; red bracket, Fatl ICD products from Fat1-IL-2R; blue 
asterisk, non-specific signal. b, Schematic of mitochondria-targeted 

Fatl ICD, Fat1™"°. Asterisk, stop codon. ¢, Top, Detection of Fat1™" in 
the mitochondrial fraction of Fat1™° -transfected 293T cells. Bottom, 
Exclusion of Fat1™"° from the nuclear fraction of Fat1™"° -transfected 
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293T cells. d, Electroporation of Fat1™° in mouse aortic SMCs, followed 
by subcellular fraction and SDS-PAGE analysis of Fat1 ICD fragments. 
Green bracket, endogenous Fat1 ICD species; blue asterisk, non-specific 
signal. e, f, Quantification of baseline (e) and maximal OCR (f) after 
introducing Fat1-IL-2R or Fat1™"° into Fat1*° cells from Fig. 3a. Data 
shown as mean +s.e.m., n= 15, significance assessed by one-way ANOVA. 
C, cytoplasmic; EV, empty vector; Mit, mitochondrial; Ms, microsomal; 
—Nuc, non-nuclear fraction; +-Nuc, nuclear fraction; WL, whole-cell 
lysate. For gel source data, see Supplementary Fig. 1. 
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Extended Data Figure 6 | Fat1 ICD associates with mitochondrial 
respiratory complexes in SMCs, and limits the incorporation of 
complex I into supercomplexes. a, Rate of cytochrome C oxidation 

by immunocaptured complex IV from mouse aortic SMCs lysates, 
expressed as fold-change from wild type (n= 18). b, Rate of hydrolysis 
of ATP to ADP and phosphate by immunocaptured complex V from 
human aortic SMCs treated with control siRNA (sictl) or FAT] siRNA 
(siFAT1), expressed as fold change from sictl. n= 4. c, SDS-PAGE analysis 
of proteins eluted from immunocaptured respiratory complexes after 
completion of enzymatic assays, followed by immunoblotting for Fat1 
and complex II, IV, and V subunits. Red arrowhead, specific Fat1 signal; 


blue asterisk, non-specific signal. d, Quantification of native complex I 
(CI) levels from BN-PAGE analyses, including the example presented 

in Fig. 3e. n=5. e, Two-dimensional BN/SDS-PAGE analysis of mouse 
aortic SMC mitochondrial lysates, immunoblotted for Fat1 and complexes 
I-V. Dashed lines indicate co-migration of Fatl ICD species with 
respiratory complexes. Bottom panels are merged images of the individual 
western blots for complexes I-V (presented in Extended Data Fig. 7). 

SC, supercomplex. NS, not significant. Data shown as mean + s.e.m. 

and analysed by two-tailed t-test (a, b, d). For gel source data, see 
Supplementary Fig. 1. 
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Extended Data Figure 7 | Two-dimensional BN/SDS-PAGE analysis of SMC mitochondrial lysates, immunoblotted for complexes I-V. Individual 
western blots for complexes I-V used to generate the merged images in Extended Data Fig. 6e, bottom panels. 
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normalized to sictl. n = 3, significance assessed by two-tailed t-test. 
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Extended Data Figure 9 | Fat1 expression is induced in SMCs 

after vascular injury. a, Immunohistochemistry for Fat1 in control 
and Fat1SM®° carotid arteries before and after arterial injury, three 

(d3) or fourteen days (d14) after carotid ligation. L, lumen. The 
internal elastic lamina has been highlighted with a black line. Scale 

bar, 251m. b, Expression of Fat] and Acta2 (also known as SMA) by 
immunofluorescence in control and Fat1S“®° carotid arteries, fourteen 
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days after injury. White squares indicate the regions shown in higher 
magnification. Yellow line marks the internal and external elastic laminae. 
Scale bar, 201m. c, Nitrotyrosine (NT) staining of control and Fat [SMKO 
carotid arteries, seven days after injury. White line marks the internal 
elastic lamina. Scale bar, 501m. d, Cells isolated from carotid arteries three 
days after ligation injury expressing SMC (Acta2, calponin 1, transgelin) 
or endothelial cell (Pecam1) markers by immunofluorescence. 
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Extended Data Table 1 | Fat1 interactors identified by TAP-MS that localize to the inner mitochondrial membrane 


Gene 


NDUFS3 


HSPAQ 


SLC25A33 


Protein 


NADH ubiquinone 
oxidoreductase core 
subunit S3 


Ubiquinol-cytochrome c 
reductase core protein II 


ATP synthase, H+ 
transporting, mitochondrial 
FO complex subunit B1 


ATP synthase, H+ 
transporting, mitochondrial 
F1 complex, O subunit 
ATP synthase, H+ 
transporting, mitochondrial 
F1 complex, a subunit 1, 
cardiac muscle 


AFG3 like matrix AAA 
peptidase subunit 2 


ATPase family, AAA 
domain containing 3A 


Translocase of inner 
mitochondrial membrane 
domain containing 1 


Translocase of inner 
mitochondrial membrane 


ATP binding cassette 
subfamily E member 1 


Heat shock protein family 
A (Hsp70) member 9 


Nipsnap homolog 1 (C. 
elegans) 


Solute carrier family 25 
member 1 


Solute carrier family 25 
member 3 


Solute carrier family 25 
member 10 


Solute carrier family 25 
member 11 


Solute carrier family 25 
member 13 
Solute carrier family 25 
member 22 


Solute carrier family 25 
member 33 


Function 


Core subunit of Cl that belongs to the minimal assembly 
required for catalysis. 


Associates with and stabilizes respiratory complexes, 
particularly Cl. Important for mitochondrial morphology and 
function, and growth control. 


AAA protease involved in IMM proteolysis. Forms a 
supercomplex with PHB at the IMM. PHB inhibits AFG3L2. 


Located at contact sites between OMM and IMM. Required 
for mitochondrial network organization, metabolism, and 
protein synthesis. Required for nucleoid stability and cell 


Formation of cardiolipin-enriched microdomains. Recruits 
PHB to these microdomains to promote PHB function: 
assembly and stability of respiratory complexes. Promotes 
supercomplex formation, oxphos, Cl and Cll activities, and 
ATP production. Enhances T-cell proliferation. 

Cl assembly: associates with core subunits of Cl and 
components of the Cl assembly machinery. Loss of function 
interrupts Cl assembly, supercomplex formation, and 
mitochondrial respiration. 


Component of the TIM23 complex; mediates the translocation 
of transit peptide-containing proteins across the IMM. 


Inhibitor of RNA suppression. Regulation of mRNA turnover. 
Interacts with various Cl subunits and Cl assembly factors. 
Associates with cristae organizing system complex (MICOS), 
and modulates mito morphology. In yeast: binds to IMM to 
regulate mitochondrial import; optimizes translocation acting 
as an m-AAA protease to degrade/refold misfolded proteins. 
Implicated in cell proliferation, aging, cell fate determination, 
differentiation, tumor development. 


Citrate/malate exchange. Important for the bioenergetics of 
hepatic cells; provides a carbon source for fatty acid and 
sterol biosynthesis, and NAD* for the glycolytic pathway. 
Transport of phosphate groups (along with H+) from the 
cytosol to the mitochondrial matrix. Regulation of the 


Transport of 2-oxoglutarate across the IMM in exchange for 
malate or other dicarboxylic acids; important for malate- 
aspartate shuttle, oxoglutarate/isocitrate shuttle, 
gluconeogenesis from lactate, and nitrogen metabolism. 


Induced by IGF1/mTOR pathway. Controls mtDNA replication 
and transcription. Promotes mitochondrial respiration and cell 
growth, and protects from ROS-mediated differentiation 


Published Interactors 


A number of Cl, Il, Ill, and V 
components 


PHB2*, AFG3L2*, TIMMS50*, 
ATAD3A*, UQCRC2*, 
STOML2%*, and many 


NUDFS3* (and other Cl 
subunits), ATP50* (and other 
CV subunits), ATP5A1*, PHB’*, 
AFG3L2*, ATAD3A*, HSPD1* 
ATP50%, ATP5A1*, SLC25A3%*, 
STOML2*, HSPD1*, AFG3L2* 
and many Cl, Cll, Cll, and CV 
subunits 

ATP5F1*, ATP5A1*, 
UQCRC2*, and other 
respiratory subunits 


ATP5F1*, ATP50*, UQCRC2*, 
AFG3L2*, STOML2*, and other 
respiratory subunits 


PHB*, ATP5F1*, ATP5A1*, 
UQCRC2*, SLC25A3*, 
TIMMDC 1* and many Cl 
subunits 


PHB*, UQCRC2*, SLC25A3*, 
HSPD1*, and other Cl and CV 
subunits. 


PHB*, ATP5A1*, ATP5F1*, 
SLC25A3*, and other Clll and 
CV subunits 


AFG3L2*, core subunits of Cl, 
Cl assembly factors, Clll and 
CIV subunits 


DNAJA1*, DNAJA2*, 
DNAJA3*, HSPD1*, Cl and Cll 
subunits 


ATP5F1*, ATAD3A*, AFG3L2*, 
STOML2*, and many Cl, Il, Ill, 
and V subunits 


HSPD1* 


LETTER 


during mitochondrial dysfunction. 
«Associated proteins that were also identified by the Fatl TAP-MS screen. 


Refs 41-66 are cited in this table. 
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Mechanism of super-assembly of respiratory 


complexes III and IV 


Sara Cogliati!*, Enrique Calvo, Marta Loureiro!, Adela M. Guaras!, Rocio Nieto-Arellano!, Carolina Garcia-Poyatos!”, 
lakes Ezkurdia'!, Nadia Mercader!?, Jestis Vazquez! & José Antonio Enriquez!? 


Respiratory chain complexes can super-assemble into quaternary 
structures called supercomplexes that optimize cellular metabolism!. 
The interaction between complexes III (CIII) and IV (CIV) is 
modulated by supercomplex assembly factor 1 (SCAF1, also known 
as COX7A2L)’. The discovery of SCAF1 represented strong genetic 
evidence that supercomplexes exist in vivo”*. SCAF1 is present as a 
long isoform (113 amino acids) or a short isoform (111 amino acids) 
in different mouse strains”*. Only the long isoform can induce the 
super-assembly of CIII and CIV”, but it is not clear whether SCAF1 
is required for the formation of the respirasome (a supercomplex of 
CI, CII, and CIV)!**, Here we show, by combining deep proteomics 
and immunodetection analysis, that SCAF1 is always required for the 
interaction between CIII and CIV and that the respirasome is absent 
from most tissues of animals containing the short isoform of SCAF1, 
with the exception of heart and skeletal muscle. We used directed 
mutagenesis to characterize SCAF1 regions that interact with CIII 
and CIV and discovered that this interaction requires the correct 
orientation of a histidine residue at position 73 that is altered in the 
short isoform of SCAF1, explaining its inability to interact with CIV. 
Furthermore, we find that the CIV subunit COX7A2 is replaced by 
SCAF1 in supercomplexes containing CIII and CIV and by COX7A1 
in CIV dimers, and that dimers seem to be more stable when they 
include COX6A2 rather than the COX6A1 isoform. 

We analysed the organization of the respiratory chain by blue native 
gel electrophoresis (BNGE) in different tissues (Fig. 1a, b). In CD1 
mice, CII and CIV are distributed among several well-characterized 
structures (IV, IV, III,, supercomplex III,+IV, supercomplex I+II], 
and supercomplex I+III,+IV). Other structures of uncertain compo- 
sition showing co-migration of I and II; I, III and IV, or only IV are 
also observed. SCAF1 is present in all superassemblies containing CHI 
and CIV (Fig. 1a). By contrast, C57BL/6] mice showed a depletion of 
SCAF1 with loss of supercomplex III,+IV and a concomitant reduction 
in supercomplex I+III,+IV, which only remains in relevant amounts 
in skeletal muscle and heart (Fig. 1b). Further analysis confirmed that 
supercomplex III,+IV was virtually absent in C57BL/6J mice while 
heart, but not liver, retained a significant amount of supercomplex 
I+III,+IV (Fig. 1c-e). In adult heart mitochondria from both strains, 
additional bands containing CIV but not CIII or SCAF1 were detected 
(Fig. 1f-h). These supernumerary CIV bands were also observed in 
neonatal C57BL/6J but not in CD1 mitochondria (Fig. 1f-h). The respi- 
rasome and supercomplex III,+IV were present in adult and post-natal 
day 3 (P3) CD1 hearts while both were absent in P3 C57BL/6J hearts 
(Fig. 1f-h). Finally, two-dimensional BNGE/SDS—PAGE immunob- 
lotting showed that supercomplex III,+IV was still undetectable in 
C57BL/6 heart (Fig. 1i-l) and that the proportion of respirasome was 
significantly lower than in CD1 heart. 

Next, BNGE lanes were sliced into 26 fractions and analysed by 
mass spectrometry using a data-independent scanning method (DiS)’. 


We obtained near-complete protein coverage of complex components 
(Fig. 2a-c). Again, in C57BL/6J mice (SCAF1!"') supercomplex 
III,+IV was virtually absent in C57BL/6) livers and heart while the 
respirasome was detected in heart mitochondria from both strains 
(Fig. 2a—c). CIV contains several tissue-specific subunit isoforms; 
COX7A1 and COX6A2 are heart/skeletal-muscle-specific isoforms 
whereas COX7A2 and COX6A1 are liver-type isoforms that are present 
in rodent hearts in significant proportions. SCAF1 is also named 
COX7A2-like owing to its high similarity to this CIV subunit. We 
reanalysed the DiS fragmentation data to generate specific quantitative 
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Figure 1 | Super-assembly between CIII and CIV requires functional 
SCAF1. a-h, Immunodetection of the indicated proteins after BNGE 

of digitonin-solubilized mitochondria from CD1 and C57BL/6J mice 
(BL/6). Mitochondria were extracted from the indicated tissues (Ly, liver; 
Kd, kidney; Br, brain; MS, skeletal muscle; Hr, heart; Ln, lung) (a-d), 
from liver (e) or from adult (A) and 3 days post-natal (P3) heart (f-h). 
i-l, Immunodetection after BNGE followed by a second dimension 
SDS-PAGE. OE, overexposed. Red arrows indicate supercomplex II],+IV 
and supercomplex I+II,+IV. Gels representative of 2 (a, b), 4 (c, d), 2 (e) 
or 1 (i-l) biological replicates; in f-h representative of 2 biological and 

3 technical replicates. 


1Centro Nacional de Investigaciones Cardiovasculares Carlos III, 28029 Madrid, Spain. Institute of Anatomy, University of Bern, CH-3000 Bern, Switzerland. Departamento de Bioquimica y 


Biologia Molecular y Celular, Universidad de Zaragoza, 50009 Zaragoza, Spain. 
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profiles of these isoforms and of representative protein components 
of the complexes (Fig. 2d-g and Extended Data Figs 1-4). Notably, 
COX7A2 was found almost exclusively in the monomeric CIV form 
in CD1 heart and liver (Fig. 2d, f, green), while SCAF1 was found 
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Figure 2 | Super-assembly of the different CIII and CIV structures 
depends on the subunit composition of CIV. DiS mass spectrometry 
analysis of BNGE gel slices of digitonin-solubilized mitochondria from 
heart and liver from CD1 and C57BL/6J mice. a—c, Quantitative profiles of 
CI (a), CII (b) and CIV (c), expressed in total peptide-spectrum matches 
(PSMs); the diagrams indicate the subunits detected of each complex. 

X boxes indicate proteins not yet described in mice; proteins encoded in 
nucleus or mitochondria are marked in black or red, respectively; complex 
IV tissue-specific isoforms are marked in blue; white boxes, not detected; 
R, respirasome. d-g, Quantitative protein profiles, expressed in E-scores 
(see Methods). 


in the supercomplex III,+IV and in the respirasome in both tissues 
(Fig. 2d, f, red). These results strongly suggest that COX7A2 is replaced 
by SCAF1 when CIV is assembled into supercomplexes containing CIII. 

Notably, this asymmetry in the distribution of the COX7A2 subunit 
was maintained in C57BL/6J mitochondria, with COX7A2 detected 
only in the free CIV in both tissues (Fig. 2e, g, green) and slightly in the 
heart respirasome (Fig. 2e, green). C57BL/6J:SCAF1!!! was detected in 
heart and barely in liver (Fig. 2e, g, red), with a pattern different from 
that of SCAF1'1°, co-migrating predominantly with III, and super- 
complex I+II2 (Fig. 2e red). These data suggest that, being unable 
to form stable supercomplex III,+IV, SCAF1"!! remained bound to 
CII. Recent results proposed that SCAF1 is a either a CIII-interacting 
protein® or a CIV subunit®, and our results show that there is no 
contradiction between these alternatives. 

COX7A1 was found only in the dimer form of CIV in CD1 or 
C57BL/6 hearts, while COX7A2 was found in the monomer (Fig. 2d, e, 
compare blue and green). Hence, COX7A2 is replaced by COX7A1 
to form CIV dimers. Liver mitochondria express only the COX7A2 
isoform and the CIV dimer was slightly detectable (Fig. le). This 
suggests that COX7A2 may allow the formation of CIV dimers but 
with lower efficiency than COX7A1. In support of this interpretation, 
isolated mice liver mitochondria in vitro imported and assembled 
[°°S]COX7A1 but not [7°S]COX7A2 into the CIV dimer in addition to 
the CIV monomer (Extended Data Fig. 5a—d). 

We detected COX6A2 in heart mitochondria of both strains 
while COX6A1 was the isoform detected in liver (Fig. 2d-g, purple 
and orange). C57BL/6] heart mitochondria expressed both COX6A 
isoforms (Fig. 2e, purple and orange). COX6A2 was more abundant in 
CIV dimers and COX6A1 in monomers (Fig. 2e). Notably, when only 
COX6A1 is expressed (liver mitochondria), the amount of CIV dimer 
(Fig. 2f, g, yellow trace) is much lower than in heart mitochondria, 
which also express COX6A2 (Fig. 2d, e). In summary, the CIV dimer 
seems to be favoured by the concomitant presence of COX7A1 and 
COX6A2 isoforms. 

Our results suggest that SCAF1!!!, the variant carried by C57BL/6] 
mice, can interact with CIII but not with CIV. COX7A1 and COX7A2 
are very similar in size and sequence (Fig. 3a), while SCAF1 is 
significantly larger, containing in its mature form a long extra domain, 
as confirmed by proteomics (Fig. 3a and Extended Data Fig. 1). We 
hypothesized that the region with sequence homology with COX7A2 
determines SCAF1 interaction with CIV, while the extra domain is 
responsible for the interaction with CIII (Fig. 3a), so that SCAF1 
acts as bridge between CIII and CIV. To evaluate this hypothesis, we 
synthesized in vitro a series of SCAF1 mutants (Fig. 3b) and studied 
their incorporation into the different structures in isolated liver 
mitochondria. [*°S]SCAF1'! was imported into mitochondria and 
associated with supercomplex III,+IV and supercomplex I+III,+IV, 
but also with supercomplexes I+IH] and IHI,, and marginally with 
free CIV during short incubation times (Fig. 3c) but not after longer 
incubation periods (Extended Data Fig. 5). The matrix-exposed region 
of COX7A harbours His44, a residue critical for its interaction with 
other subunits of CIV, which corresponds to His73 in SCAF1 (Fig. 3d). 
Accordingly, the His73Ala SCAF1'? mutant failed to interact with 
supercomplex III,+IV and supercomplex I+III,+IV, but maintained 
the ability to interact with supercomplex I+II], and Il, (Fig. 3c, left 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


COX7A1 
a COX7A2 


M-R-ALRVSQALVRSFSSSTRS~~-----------------------------1 HLENRVA 27 
MLRNLLALROIAORTISTTSRR-~-----------------------------1 HFENKVP 29 


SCAF1113_  MYYKFSSFTOKLAGAWASEAYTPOGLKPVSTEAPPIIFATPTKLTSSVTAYDYSGKNKVP 60 
a) 
Mitochondrial Specific SCAF1 Extra domain 
targeting —_N terminus 
sequence 
COX7A1 EKQKLFQADNDLPVHLKGGGMDNVLYRLTMTLTLGGTAYCLYCLGWASFPHKK- 80 
COX7A2 EKQKLFQEDNGMPVHLKGGASDALL YRATMALTLGGTAYAIYLLAMAAFPKKON 83 
SCAFI1!3  ELOKFFOQKADGVPIHLKRGLPDQMLYRTTMALTLGGTIYCLIALYMASQPRNK- 113 
a 
b Matrix exposed Transmembrane domain —_Intermembrane 
SCAF1113 MYYKFSSFTOKLAGAWASEAYTPQGLKPVSTEAPPIIFATPTKLTSSVTAYDYSGKNKVP 60 
SCAF1113:H73A  MYYKFSSFTOKLAGAWASEAYTPOGLKPVSTEAPPIIFATPTKLTSSVTAYDYSGKNKVP 60 
SCAF1113;4uTS OQKLAGAWASEAYTPOGLKPVSTEAPPIIFATPTKLTSSVTAYDYSGKNKVP 60 
SCAF1113;AED MYYKFSSFTOKLAGAWASEAYT| |NKVP 60 
Scagi MYYKFSSFTOKLAGAWASEAYTPOGLKPVSTEAPPI IFATPTKLTSSVTAYDY, 60 
CARE MYYKFSSFTOKLAGAWASEAYTPOQGLKPVSTEAPPIIFATPTKLTSSVTAYDYSGKNKVP 60 
SCAF1141;1?t  MYYKFSSFTOKLAGAWASEAYTPOGLKPVSTEAPPIIFATPTKLTSSVTAYDYSGKNKVP 60 
SCAFI!3 ELQKFFQKADGVPIHLKRGLPDQMLYRTTMALTLGGTIYCLIALYMASQPRNK 113 
SCAFI213+#73A — ET.QKFFQKADGVPIALKRGLPDOMLYRTTMALTLGGTIYCLIALYMASQPRNK 113 
SCAFI13:4MTS_ = ELOKFFQKADGVPIHLKRGLPDQMLYRTTMALTLGGTIYCLIALYMASQPRNK 113 
SCAPI113;A4ED ELQKFFQKADGVP IHLKRGLPDQMLYRTTMALTLGGTIYCLIALYMASQPRNK 113 
SCAFT213;AIV I ] 113 
SCAFI?? ELQKFFQKADGF--HLKRGLPDQMLYRTTMALTLGGTIYCLIALYMASQPRNK 111 
SCAFI11iPr ELOKFFQKADGFPIHLKRGLPDOMLYRTTMALTLGGTIYCLIALYMASQPRNK 111 
c SCAF1: 113 H73A 111 iP| SCAF1"'3: AMTS AIV AED 
FCCP: “- + - + — + - + FCCP: - + = + - + 
7 i ~ wr 
[IIL +1V - v - < 
2 
lll, t 
I+ r 
“G-e @- 
J ee call 
d ve aa ? 


SCAF1""' 69 pgp--unK74 
COX7A1 


SCAF1113 16 NGLPVHLK23 


P SCAF113 SCAF1"1 
y \ | His21 
y) y 
y 
&\  é - 


— a) 
Figure 3 | Functional characterization of SCAF1 domains. 
a, b, Alignment of the amino acid sequences of COX7A1, COX7A2 
and SCAF1!3 (a) and of the mutant constructions (b). The critical His 
residue is highlighted in light green. The surrounding residues, needed 
for the localization of this His are in purple. Gln residue at the end of the 
mitochondrial targeting sequence is in blue. The deleted strings of amino 
acids are boxed and the substitutions or insertions underlined. H73A, 
His to Ala substition in SCAFI!; AMTS, deletion of the mitochondrial 
targeting sequence; AED, deletion of the SCAF1 extra domain; AIV, 
deletion of the area similar to COX7A proteins; iPI, Pro and Ile insertion 
in SCAFI'''. ¢, Mitochondrial import assay. Autoradiography of BNGE 
of CD1 mice liver digitonin-solubilized mitochondria after import of 
the indicated radiolabelled proteins. Labelling was fully ablated when 
mitochondrial import was prevented by FCCP-mediated dissipation 
of the membrane potential. Red arrows indicate supercomplex III,+IV- 
and supercomplex I+III,+IV-containing SCAF1. Representative of 
4 biological replicates. d, Modelling of the structural consequences of 
the PI micro-deletion suffered by SCAF1 in C57BL/6J mice (SCAF!"’), 
assumed to disrupt the spatial orientation of His73, altering the 
conformation required for interaction with CIV. 


69 DGVPIHLK74 


SCAF1' = 
SCAF1113 
COX7A1 


panel). SCAF1!!! lacks two amino acids adjacent to His73; structural 
modelling suggested that this deletion strongly affects the relative 
spatial position of this residue (Fig. 3d). Accordingly, SCAF1""! repro- 
duced exactly the phenotype of the His73Ala SCAF1''? mutant (Fig. 3c, 
left panel). Reintroduction of two amino acids into the sequence of 
SCAF1!"! was sufficient to partially restore the interaction with super- 
complex HI,+IV and supercomplex I+III,+IV (Fig. 3c, middle panel). 

Elimination of the SCAF1!"° extra domain (SCAF1!!*4£°) abolished 
the interaction with the respirasome and supercomplex III,+IV 
and also with supercomplexes I+TIII, and II], but did not affect the 
interaction with free CIV (Fig. 3c). Finally, elimination of the region 
with sequence homology with COX7A2 (SCAF1!!*4!’) or the mito- 
chondrial targeting sequence (SCAF1!!*4™"S) abolished the import of 
SCAF1'3 (Fig. 3c, left panel). These results suggest that SCAF1 acts as 
a bridge between CIII and CIV. 

To rule out the possibility that other unknown genetic elements 
unrelated to SCAF1 could contribute to the observed differences 
between CD1 and C57BL/6, we backcrossed 129S2/SvPasCrlf (which 
has SCAF1!13) with C57BL/6J to generate purebred C57BL/6J mice 


a b d = 2 
F ‘ = = Bo FO 
Liver Heart Liver Heart g g Ge2£ ex 
r2@ 5 2 2 5 m2 oa mo6600 
coco © © co © co © 
a cel cal = ae a 
no mo mn a Oo mM Flll,+iV- re | — 
q lll, 7 
— am” - ~ IL +IV = ~ ee 
_—— - il, - 
mg ,_ — oa —_ 
- Ilyv- pod P ‘ 
“es = oem: LV, - V- SCAFI lad 
on i at a - 
_ bre 
lal, 
¢ CORE! and Cox! SCAF1 Il, +IV ’ = 
ll; - ca ga od 
© x 100 50 
83 6 es -cre TT 
a SCAF1 COX5B UQCRC1 \V- CORE1 
= te. a ae ae 
S385 SEE BEES f 
& = - ee ee ae | 
ie lll +1V— 
= > f=MyHV FAIL 
=a | le tv 
\ i eM Ill, 
BL613 BL6"™"1 BLe"3 BL6™"1 BL6™3 BL6™1 
E-score E-score E-score Iv 
LTSSVTAYDYSGK EDPNLVPSISNK EVESIGAHLNAYSTR mM Merged 
9g cl 


ClV2 


COX6A2; COX7A1 


Cll, 


Ly COXTA2 _ pp COXTAZ 


— scAFi<— 


— SCAF1 <—~ 


AUIGeAS 


COX7A2 
Clll, 
Cllly + cw 


COX6A1; COX7A2 


ae Peps > 


ClV2 


Figure 4 | Recovery of SCAF1 function and models of CIV super- 
assembly. a-c, Heart and liver digitonin-solubilized mitochondria from 
C57BL/6] mice harbouring either SCAF'!? or SCAF1'! were subjected 

to BNGE and then to immunodetection with the indicated antibodies 

(a, b; representative of 4 biological replicates) or to quantitative DiS mass 
spectrometry analysis of representative proteins (c) in the form of contour 
plots indicating percentage of maximum E-score of the indicated peptides, 
normalized according to the level of ATP-synthase within each sample 
(d-f; 1 biological replicates). The same immunodetection analysis was 
performed with mitochondria from the indicated cell lines. g, Proposed 
model of CIV dimerization and super-assembly driven by the exchange 

of CIV subunit isoforms. 


harbouring SCAF 119 (B6J.129S2-Cox7a2l). SCAF1!" was sufficient to 
recover supercomplex III,+IV formation in both liver and heart, res- 
cued the respirasome in liver, and increased the respirasome content in 
heart (Fig. 4a). In these mice, SCAF1!!3 was immunodetected in the res- 
pirasome and in supercomplex III,+IV in both liver and heart (Fig. 4b). 
Proteomics analysis confirmed the assembly of supercomplex 
III,+IV containing SCAF1 in both liver and heart of SCAF1!" 
C57BL/6J mice (Fig. 4c). Finally, while both supercomplex III,+IV 
and the respirasome were detected in C57BL/6-derived fibroblasts 
containing SCAF1!", these structures were completely absent in 
C57BL/6-derived fibroblasts containing SCAF 1!!! (Fig. 4d-f, left lanes). 
When fibroblasts lacking either CIII or CIV were compared with their 
respective isogenic control cell lines, we observed that SCAF1 was 
found to co-migrate with free CIV in the absence of CIII or with CII] 
in the absence of CIV (Fig. 4d-f). These observations further support 
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the notion that the co-migration of SCAF1 with structures containing 
III,+IV is due to a true interaction with these supercomplexes. 

Here we unveil a function for different isoforms of CIV subunits, whose 
interchange drives the assembly of CIV into different supercomplex 
structures. In our experimental conditions, COX7A2 and COX6A1 would 
favour the free CIV form or decrease the stability of the CIV dimer, while 
COX7A1 and COX6A2 would stabilize the CIV dimer (Fig. 4g). Notably, 
COX6A2 is located at the interface of the two monomers’, where it could 
directly participate in stabilizing the CIV dimer, while COX7A1 is located 
away from the interface, so that it would have an indirect conformational 
role in dimer stabilization (Extended Data Fig. 6a). Similarly, the replace- 
ment of COX7A2 by SCAF1 would introduce a bridging motif that allows 
the binding of CIV to the CIII dimer (Fig. 4g). In fact, computational 
modelling suggested that this tentative structure is feasible (Extended 
Data Fig. 6b). Two reports presenting refined cryo-electron microscopy 
structures of the respirasome have recently been published'™". One of 
them speculated that COX7A1 can be substituted for SCAF1 (ref. 10), in 
full agreement with our experimental conclusions, although the level of 
resolution achieved did not enable the determination of which of these 
two proteins was present in the structure. Our results also support the 
existence of two types of respirasome in our experimental settings, one 
in which III, and CIV are bound through SCAF1 and interact together 
with CI (SCAF1-mediated respirasome or R®: I+III,+IV), and another 
in which III, and CIV interact with CI but not between them (R®°: 
IV+I+IIl)) (Fig. 4g). Notably, two different structures of the respirasome 
were reported by Letts et al.'° that precisely differed in the presence or 
absence of an interaction between CIII and CIV. It is tempting to speculate 
that these two structures may be differentiated by the presence of SCAF1 
or COX7A2, determining whether the interaction between CHI and CIV 
takes place or not. It is likely that R° would be more stable and abundant 
than R"S. Our model also reconciles the discrepancies due to SCAF1 
microdeletion in C57BL/6) mouse strains with regards to the impact of 
harbouring SCAF1'"! or SCAF1'3 in the supercomplexes’ organization 
of the electron transport chain“, In addition, our model explains why the 
predominant CIV-containing supercomplexes have a single copy of CIV, 
whereas the crystal structure of CIV was obtained froma purified dimer”. 

In summary, the structural arrangement of the components of the 
electron transport chain is substantially richer than anticipated and the 
functional implications of the diversity of complex IV ‘flavours’ remain 
as an exciting challenge that awaits further investigation. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized and the investigators were not blinded to allocation during 
experiments and outcome assessment. 

Molecular biology. The cDNA of SCAF1 113, 111 and the indicated mutants were 
synthesized and cloned in the pTNT vector by GenScript. 

Mitochondrial isolation. Mitochondria were isolated from mouse tissues as 
indicated elsewhere’. 

Mitochondrial import assay. SCAF1!!3, SCAF11"', and the different mutant 
proteins were cloned in the pTNT vector and in vitro translated using the 
TNT T7/SP6 Coupled Reticulocyte Lysate System (Promega) according to the 
manufacturer’s instructions using EXPRE*’S Protein Labelling Mix (Perkin Elmer). 
3.51 of the reaction mix (out of 50,11) were incubated with 100 ,g of isolated 
liver mitochondria from CD1 mice in 401] of reaction buffer (2mM succinate, 
1mM ATP, lmg ml! BSA, 1mM methionine, 40 Lg ml! chloramphenicol) for 
40 min (Fig. 3) at 37°C. Following this, they were washed two times with 1 ml of 
reaction buffer to remove the excess of non-imported proteins. Then, the washed 
mitochondrial pellet was incubated while rotating in 10011 of reaction buffer for 
1h at room temperature. To increase the incorporation rate and evaluate whether 
different COX subunits can be incorporated in different forms of supercomplexes, 
the mitochondria with the radiolabelled proteins were incubated in 500,11 of the 
same reaction buffer without pre-incubation and during 6h on rotation at room 
temperature (Extended Data Fig. 5a—d). In all cases mitochondria were further 
processed for the BNE and the gel electroblotted onto onto Hybond-P-polyvinylene 
fluoride (PVDF) membranes (GE Healthcare) and the radioactive signal detected 
following exposure of 2-4 days. 

Blue native gel electrophoresis (BNGE). Respiratory chain complexes and super- 
complexes were analysed by BNGE as described elsewhere’. 

Western blotting. After electrophoresis, gels were electroblotted onto Hybond- 
P-polyvinylene fluoride (PVDF) membranes (GE Healthcare) and immunob- 
lotted with specific antibodies against the different subunits of the complexes 
anti-COX7A2L polyclonal, St John’s laboratory (STJ110597); anti-COX7A2L 
monoclonal, Leitat Technological Center, Biomed Division; anti-COX7A1, 
Abcam monoclonal (AB123591); anti-COX5B policlonal, Proteintech (11418- 
2-AP); anti-COX1, Invitrogen monoclonal (35-8100); anti- UQCR10, Origene 
(NM_013387); anti-NDUFS3 monoclonal Abcam (AB14711); anti-UQCR2 
poloclonal Proteintech (14742-1-AC); anti- UQCRC1 Abcam (118687). The 
secondary antibodies were goat anti-rabbit IgG (H+L), Alexa Fluor 680 conjugate 
(A-21076) from Life Technologies and anti-mouse dylight 800 from Rockland 
(610145002) and acquired with the ODYSSEY Infrared Imaging System (LI-COR). 
Animal strains. Parental C57BL/6JOlaHsd and CD1 strains were purchased from 
Harlan Laboratories. Note that the C57BL/6JOlaHsd strain does not harbour the 
nicotinamide nucleotide transhydrogenase (NNT) spontaneous mutation that 
renders the encoded enzyme undetectable, and which is characteristic of the 
C57BL/6] strain provided directly by Jackson Laboratories. B6J.129S2-Cox7a2l 
(BL6!!°) strain was generated by crossing purebred female C57BL/6] mice 
contained the short allele of COX7A2L (SCAF1!!!) with purebred male 129S2/ 
SvPasCrlf contained the long allele of COX7A2L (SCAF 11°) both in homozygosis. 
The resulting heterozygous offspring were backcrossed until the F10 to obtain 
purebred C57BL/6] background mice containing the long form of COX7A2L in 
homozygous. Offspring after the tenth generation of backcrossing was considered 
backcrossed. All animal procedures conformed to EU Directive 86/609/EEC and 
Recommendation 2007/526/EC regarding the protection of animals used for 
experimental and other scientific purposes, enforced in Spanish law under Real 
Decreto 1201/2005. Four-month-old male mice were used as a source to purify 
mitochondria for molecular biology analysis, therefore sample size estimation, 
randomization and blinding was not necessary. 

Genotyping. Total DNA extracted from tails was genotyped by PCR with the 
following primers: COX7A2L forward CTTTCTTGCTTTGCAGAAGG; 
COX7A2L reverse GAAGGCCTCGTTTCAGGTGG. The product of PCR was 
resolved in a 15% acrylamide/bisacrylamide gel. The COX7A2L short allele 
generates a band of 50 bp while the long allele generates a band of 56 bp. 

Cell lines. All cell lines were grown in DMEM (D5796 Sigma-Aldrich) 
supplemented with 5% FBS (Gibco) and 1 mM sodium pyruvate (Lonza) in 
5% COz, 95% air atmosphere at 37°C. The different cell lines were generated as 
described previously". The CIII®® cell line and its isogenic control where called 
A22 and E9 respectively in the original publication!. The CIV®® cell line and 
its isogenic control where described elsewhere”'®. BL6!!! and BL6'!? MAF SV40 
transformed cell lines were generated respectively from wild-type C57BL/6JOlaHsd 
and B6J.129S2-Cox7a2] mice. All cell lines tested negative for mycoplasma. 
Mitochondrial proteomics by data-independent scanning mass spectrometry. 
BNGE gels were excised in 26 slices taking as reference some discrete Coomassie- 
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stained bands from the same sample: slice 6 corresponds to a band that mainly 
contains supercomplex I+IIIy, slice 10 to CV (ATP synthase), slice 12 to free 
supercomplex III, and slice 15 to free CIV. All slices were in-gel digested with 
trypsin. The resulting tryptic peptide mixtures were subjected to nano-liquid 
chromatography coupled to tandem mass spectrometry using a data-independent 
scanning (DiS) method as described previously’. In brief, peptides were injected 
onto a C-18 reversed phase (RP) nano-column (751m LD. and 50cm, Acclaim 
PepMap, Thermo Fisher, San José, CA, USA) and analysed in a continuous 
acetonitrile gradient consisting of 8-31% B for 130 min, 50-90% B for 1 min 
(B=0.5% formic acid in acetonitrile). Peptides were eluted from the RP nano- 
column at a flow rate of ~200 nl min“! to an emitter nanospray needle for real-time 
ionization and peptide fragmentation in either a Q-Exactive or a Q-Exactive HF 
mass spectrometer (Thermo Fisher). Each sample was analysed in two chromato- 
graphic runs covering different mass ranges (from 400 to 750 Da, and from 750 to 
1,100 Da, respectively). The DiS cycle consisted of 175 sequential HCD MS/MS 
fragmentation events with 2-Da windows that covered the whole 350 Da range. 
HCD fragmentation was performed at a resolution of 17,500 and a maximum 
injection time of 80 ms with the AGC set to a target of 3 x 10° ions. The whole cycle 
lasted 305 or less depending on ion intensity during chromatography. The narrow 
windows used for fragmentation allowed peptide identification using conven- 
tional shotgun searching algorithms. This was done using Sequest running under 
Proteome Discoverer 1.4 (Thermo Fisher Scientific), allowing two missed cleav- 
ages, and using 2 Da and 20p.p.m. precursor and fragment mass tolerances, 
respectively. Met oxidation and Cys carbamydomethylation were selected as 
dynamic or static modifications, respectively. 1% FDR was used as a criterion 
for peptide identification; this parameter was estimated using a separate inverted 
database and the refined method!®. 

Quantitative proteomics analysis and automated validation of MS/MS spectra. 
Two different approaches were used. For quantitative analysis of OxPhos complexes 
we used the total spectral counts’” of the proteins belonging to each complex. For 
specific protein quantitation we used the sum of intensities of all the observed 
fragment ions of charge state from +1 to +3 that were matched within 20 p.p.m. 
to theoretical fragmentation series of representative proteotypic peptides, which 
we have called the E-score. The sequence of peptides used to draw these traces, 
together with their MS/MS spectra, showing the specificity of the quantified 
fragments, are shown in Extended Data Figs 1-4. All E-score traces in Fig. 2 are 
drawn at the same relative intensity scale and are normalized according to the 
level of ATP synthase within each sample (Extended Data Fig. 5e, f). A custom 
application in R (Vseq) was developed to calculate the E-scores and to generate 
quality control plots to monitor the specificity of peptide quantitation (see 
Extended Data Figs 1-4). 

Structural modelling. Homology models for COX6A1 and COX7A2 were 
generated using Modeller'®, on the basis of protein templates from PDB 2Y69 
corresponding to chains G and J, respectively. The tentative structural model 
of SCAF1-mediated supercomplex III2+IV formation was generated based on 
homology with COX7A1 (PDB accession code 1V54) and UQCR11 (PDB accession 
code 1BGY) using Modeller. The two domains were oriented according to the 
fitted model for bovine mitochondrial supercomplex I+III,+IV by single particle 
cryo-EM on PDB 2YBB, using Pymol and focusing in the III,+IV adjustment 
(The PyMOL Molecular Graphics System, Version 1.8 Schrédinger, LLC). 

Data availability. All the data that support the findings of this study including 
mass spectrometry data are freely available from the corresponding authors upon 
request. 

Code availability. Vseq application (R code) is freely available upon request. 
Source data for the blots/gels in Figs 1, 3 and 4 and Extended Data Fig. 5b, and 
E-score data represented in Fig. 2 are provided in the Supplementary Information. 


12. Fernandez-Vizarra, E. et al. Isolation of mitochondria for biogenetical studies: 
An update. Mitochondrion 10, 253-262 (2010). 

13. Acin-Pérez, R., Fernandez-Silva, P.,, Peleato, M. L., Pérez-Martos, A. & Enriquez, 
J. A. Respiratory active mitochondrial supercomplexes. Mol. Cell 32, 529-539 
(2008). 

14. Acin-Pérez, R. et al. Respiratory complex III is required to maintain complex | 
in mammalian mitochondria. Mol. Ce/l 13, 805-815 (2004). 

15. Diaz, F., Fukui, H., Garcia, S. & Moraes, C. T. Cytochrome c oxidase is required 
for the assembly/stability of respiratory complex | in mouse fibroblasts. Mol. 
Cell. Biol. 26, 4872-4881 (2006). 

16. Navarro, P. & Vazquez, J. A refined method to calculate false discovery rates for 
peptide identification using decoy databases. J. Proteome Res. 8, 1792-1796 
(2009). 

17. Nahnsen, S., Bielow, C., Reinert, K. & Kohlbacher, O. Tools for label-free peptide 
quantification. Mol. Cell. Proteomics 12, 549-556 (2013). 

18. Webb, B. & Sali, A: Comparative protein structure modeling using MODELLER. 
Curr. Protoc. Bioinformatics 47, 5.6.1-32 (2014). 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


SCAF 1" 


Peptide a 


| Pee ENOR nee eee er aera 
MYYKFSSFTQKLAGAWASEAYTPQGLKPVSTEAPPIIFATPTKLTSSVTAYDYSGKNKVP 60 


a 
Peptide b 
en rial Specific 
argetin ; 
sequ ong. N-terminus 


ELOKFFOKADGVP TELKRGLPDOMLYRTTMALT 


Matrix exposed Transmem 


SCAF1 


Extra domain 


'LGGTIYCLIALYMASQPRNK- 113 


brane domain —Intermembrane 


: SCAF1 peptide a: LTSSVTAYDYSGK 
COX7A2|_MOUSE: Cox7a2I; Cox7a2R, SCAF1 
s0s| ; FirstScan= 18349 
| & Xcorr= 2.89715147 
Bre; 1 Charge= 2 
2 tero4 i H SQL-Sequence= LTSSVTAYDYSGK 
Bese, & RT= 59.930912 
2 | ~ 2. 8 DeltaM= 1.32387300000005 
© 2e'04- ¥ 
= | ; Escore= 3898201.69 
TAO ' m.Mass= 1392.992723 
5e'03/ 0. ° H 
\ H 
2e03 |. ' . 
0 20 40 
error in ppm. 
MZ= 697, Charge= CHARGE=2+ LTSSVTAYDYSGK+ 1.32387300000005 
0 
=40 bee: Ty) ys 
6e'05 - = |= - a e 
2 = -_ AQ E 
go - 8 
2 46°05 | & bide 30 ok 
g a 20 cr vies e 
E g - - 2 6 
me O uy) TE 
D10 Ll w 
S a = 
0e'00 - aims ty) 
200 400 600 800 1000 1200 HAOTHONDHOrAAMArTCAoNOHTHAK 
Cc m/z b series is y series 
Seni SCAF1peptide b: LAGAWASEAYTPQGLKPVSTEAPPIIFATPTK 
je'05+  “o ~ we FirstScan= 96073 
ere by ‘ Xcorr= 6.8665 
3 $. ' Charge= 3 
2 real SQL-Sequence= LAGAWASEAYTPQGLKPVSTEAPPIIFATPTK 
2 oe ' : RT= 196.88 
c. et ' ‘« DeltaM= 2.03463099999954 
2 sews} @° Escore= 720811.67 
7 e . - m.Mass= 3314.785971 
2e03t 92°” io 
a) | 6 
i) 
0 20 40 
error in ppm. 


MZ= 1104.92261, Charge= CHARGE=3+ 


LAGAWASEAYTPQGLKPVSTEAPPIIFATPTK 


50 
_ = 
Zz 501% £ 
5 irl 40 S 
i ” 
> $ i = 
=) Q ee a 30 «6 
= © if i=4 
® € 30 ” i D 
z a lle aot > 
a th hase wie a 
a Ee ve 10 
> 10 if iat | 2 
Ss . i 3 
T T T T sane 2 
500 14000 1500 2000 TM ONSET ALES RRNR SL SRHAKAS ELS EAN IH 
miz b series y series 


Extended Data Figure 1 | Specificity of tandem mass spectrometry (MS/ 
MS) fragmentation series used to generate quantitative profiles for 
SCAF1-specific peptides. a, Amino acid sequence of SCAF1!"; boxes 
indicate structural domains. The location of the proteotypic peptides 
identifying the extra domain (b) and the N-terminal region of SCAF1 

(c) are indicated. b, c, Quality control plots for proteotypic peptide 
fragmentation were generated using Vseq, a program written in R. Top, 
intensity versus fragment mass error plot; only the fragments with a mass 
error below 20 p.p.m. were assumed to be correct matches. Summation of 
the intensities located inside the blue dashed box represents the E-score. 
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Lower left, representative MS/MS spectrum of the peptide indicating 

the matched fragments. Lower right, diagram showing the colour 

coded mass error of the fragments ranked according their m/z values 

and their correspondence with theoretical fragmentation series. Grey 
background colour indicates errors higher than 50 p.p.m. The b and the 

y fragmentation series form a V-shaped path that joins fragments with 
the same charge. The completeness of the “V’ shape assesses the quality of 
the identification. Note that since HCD fragmentation was used, y-type 
fragment ions are more intense than b-type ions. 
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Extended Data Figure 2 | Specificity of MS/MS fragmentation series used to generate quantitative peptide profiles for COX7A2 and COX7A1. 
a, b, Profiles for COX7A2 (a) and COX7A1 (b). The quality control plots are as in Extended Data Fig. 1. 
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Extended Data Figure 3 | Specificity of MS/MS fragmentation series used to generate quantitative peptide profiles for COX6A1 and COX6A2. 
a, b, Profiles for COX6A1 (a) and COX6A2 (b). The quality control plots are as in Extended Data Fig. 1. 
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Extended Data Figure 4 | Specificity of MS/MS fragmentation series used to generate quantitative peptide profiles for COX5B and UQCRC1. 
a, b, Profiles for COX5B (a) and UQCRCI (b). The quality control plots are as in Extended Data Fig. 1. 
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Extended Data Figure 5 | Mitochondrial import assay of CIV proteins 
and quantitation of CIV by mass spectrometry. a—d, Autoradiography of 
BNGE of CD1 mice liver digitonin-solubilized mitochondria after import 
of the indicated radiolabelled proteins. Labelling was fully ablated when 
mitochondrial import was prevented by FCCP-mediated dissipation of 
the membrane potential. Images represent two biological replicates. OE, 
overexposed. Red arrows indicate dimer CIV. e, f, Quantitative contour 
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plots of the most abundant proteins of CV. Data-independent (DiS) 

mass spectrometry analysis of BNGE gel slices of digitonin-solubilized 
mitochondria from heart and liver from CD1 and C57BL/6J mice. e, Total 
spectral counts of the indicated components of ATP synthase, illustrating 
the reproducibility obtained upon slicing of BNGE gels. f, Total spectral 
counts after normalization according to ATP synthase levels. 
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Complex IV dimer: Bottom view 
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Extended Data Figure 6 | Structural models of supercomplex assembly. supercomplex III,+IV structure stabilized by a SCAF1 bridge. SCAF1 is 


a-c, Structural modelling of COX6A1 and COX7A2 isoforms on the proposed to substitute COX7A2 in CIV and to interact with IIL, via its 
CIV dimer. The structural model for COX6A1 (in blue) is based on its extra domain. The structural model for SCAF1 (red mesh) was generated 
homology with COX6A2 (PDB 2Y69 chain G, in red) and the structural based on its homology with COX7A1 (PDB 1V54, in blue) and with 
model for COX7A2 (in orange) is based on its homology with COX7A1 UQCRI11 (PDB 1BGyY, in green). CIV is shown in orange and the two 
(PDB 2Y69 chain J, in green). d, Tentative model showing a possible components of CIII dimer in yellow and grey. 
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Cascading MutS and MutL sliding clamps control 
DNA diffusion to activate mismatch repair 


Jiaquan Liu!*, Jeungphill Hanne, Brooke M. Britton!, Jared Bennett!, Daehyung Kim’, Jong-Bong Lee*? & Richard Fishel!* 


Mismatched nucleotides arise from polymerase misincorporation 
errors, recombination between heteroallelic parents and chemical 
or physical DNA damage’. Highly conserved MutS (MSH) and MutL 
(MLH/PMS) homologues initiate mismatch repair and, in higher 
eukaryotes, act as DNA damage sensors that can trigger apoptosis”. 
Defects in human mismatch repair genes cause Lynch syndrome 
or hereditary non-polyposis colorectal cancer and 10-40% of 
related sporadic tumours’. However, the collaborative mechanics 
of MSH and MLH/PMS proteins have not been resolved in any 
organism. We visualized Escherichia coli (Ec) ensemble mismatch 
repair and confirmed that EcMutS mismatch recognition results in 
the formation of stable ATP-bound sliding clamps that randomly 
diffuse along the DNA with intermittent backbone contact. The 
EcMutS sliding clamps act as a platform to recruit EcMutL onto the 
mismatched DNA, forming an EcMutS-EcMutL search complex that 
then closely follows the DNA backbone. ATP binding by EcMutL 
establishes a second long-lived DNA clamp that oscillates between 
the principal EcMutS-EcMutL search complex and unrestricted 
EcMutsS and EcMutL sliding clamps. The EcCMutH endonuclease 
that targets mismatch repair excision only binds clamped EcMutL, 
increasing its DNA association kinetics by more than 1,000-fold. The 
assembly of an ECMutS-EcMutL-EcMutH search complex illustrates 
how sequential stable sliding clamps can modulate one-dimensional 
diffusion mechanics along the DNA to direct mismatch repair. 

Mismatch repair (MMR) is an excision-resynthesis system that is 
initiated at a DNA strand break, which may be located 3/ or 5’ and 
hundreds to thousands of base pairs away from the mismatch’. The 
fidelity of replication-associated MMR depends on communicating 
mismatch recognition to the distant strand break and then exclusively 
directing excision of the strand containing the misincorporation 
error’. Resynthesis of the single-strand DNA (ssDNA) gap appears 
to be independent of excision and is performed by the replicative 
polymerase’. Although the source of the strand break that directs MMR 
in most organisms remains unclear!*°, a subset of “\-proteobacteria 
that includes E. coli have evolved DNA adenine methylation (Dam) 
and the MutH endonuclease to specifically introduce a scission into a 
transiently hemimethylated Dam GATC site on the newly replicated 
strand’. 

How MSH and MLH/PMS proteins communicate mismatch 
recognition to distant MMR components such as MutH continues to 
be uncertain’. A recent study with Thermus aquaticus (Ta) proteins 
suggested that TaMutL traps TaMuts at the mismatch, which then 
hypothetically acts as a catalyst to polymerize multiple TaMutL along 
the DNA’. Other studies have visualized Saccharomyces cerevisae (Sc) 
and human (Hs) MSH and MLH/PMS proteins diffusing along the 
DNA, although the role of these operations in MMR is unknown!"}3, 
These mechanisms contrast a traditional model where the assembly 
of a MutS-MutL complex executes directional ATP-hydrolysis-driven 
DNA translocation to activate distant MMR!*!°, 


We used single-molecule total internal reflection fluorescence 
microscopy to image E. coli MMR on fields of ~200 well-defined 
doubly-tethered 17.3-kb mismatched DNAs’® (Extended Data 
Fig. 1, Extended Data Table 1, Methods). Injection of Alexa647 
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Figure 1 | The formation of an ECMutS-EcMutL complex alters the 


diffusion properties of EcMuts. a, Representative kymographs and 
illustration showing an EcMutL loaded by an EcMutsS sliding clamp. Blue 
line indicates the mismatch position. b, The distribution of dwell times 
for the EcCMutS-EcMutL complex (Ton.zemuts-EcMutL3 Mean + s.e.m.). 

c, Box plots of D for the EcMutS-EcMutL complex at different NaCl 
concentrations (n = number of molecules; Methods). d, The distribution 
of the starting positions for EcMutS (top) or ECMutS-EcMutL complexes 
on DNA (bottom). There are two possible orientations of the mismatched 
DNA with mismatch position (blue star; middle panel). Diamonds in top 
and bottom panels represent individual starting events. Gaussian fit to the 
top panel distributions is shown as black lines with the mean + s.d. 
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(AF647)-labelled EcMutS with ATP!° (Extended Data Tables 1, 2, 
Extended Data Fig. 2a, Methods) resulted in numerous particles (93% 
of total) that diffused along the DNA with a diffusion coefficient 
(Decmuts = 0.043 + 0.016 j1m? s~') and an association lifetime 
(ToneEcMut$ = 185 + 35s) that was two- to three-fold less than that of 
TaMutS or H3MSH2-HsMSH6 (Extended Data Tables 3, 4, Extended 
Data Fig. 2c-e)'*'°!7, These observations support previous studies 
showing that mismatch binding provokes the formation of ATP-bound 
MSH sliding clamps that freely diffuse on DNA!*!!"3!8!9_ Tn contrast to 
other single-molecule imaging studies!!”°, we found that Cy3-labelled 
EcMutL!* (Extended Data Tables 1, 2, Extended Data Fig. 2a) only 
binds DNA at low ionic strength or in the absence of magnesium, which 
was not altered with ATP (Extended Data Fig. 3). 

Co-injection resulted in frequent co-localization of EcCMutS-AF647 
and EcMutL-Cy3 particles on the mismatched DNA (Fig. 1a, Extended 
Data Fig. 4a, Supplementary Video 1). Most particles (80%) displayed 
unambiguous real-time coordinated movement along the DNA 
(Extended Data Table 3, Extended Data Fig. 4b) that permitted deter- 
mination of the lifetime (Ton. EcMuts-EcMutL = 43 + 38) and diffusion 
coefficient (Decmuts-BeMutL = 0-004 + 0.002 1m? s~!; Fig. 1b, ¢, 
Extended Data Table 4). The diffusion coefficient was constant over 
a range of ionic strengths (Fig. 1c, Extended Data Table 4), indicating 
the EcMutS-EcMutL complex remains in continuous DNA contact 
during diffusion?!. By estimating the Stokes radius from published 
structures” we calculated an average free-energy barrier (¢) of 1.7 
kgT for rough-landscape diffusion?’ (see Methods). These results are 
consistent with the conclusion that the EcCMutS-EcMutL complex 
can engage in an efficient DNA search that includes rotation-coupled 
one-dimensional (1D) diffusion along the backbone!®??, 

Although EcMuts sliding clamps originated at the mismatch (Fig. 1d, 
top), the initial association with EcMutL appeared random along the 
DNA (Fig. 1d, bottom). Following the assembly of an ECMutS-EcMutL 
complex, four types of dissociation mechanics were observed (Fig. 2a). 


49% 
22% 
27% 


Ls soqo< 2% 


The least common type (2% of total events) was simultaneous dis- 
sociation of both EcMutS and EcMutL from the mismatched DNA 
(Fig. 2a). The most common type (49%) engaged in a dynamic 
dissociation-association (oscillating) complex on the DNA (Fig. 2a, b, 
white arrows, Extended Data Fig. 4c, Supplementary Videos 2, 3). 
The association time for the oscillating EcCMutS—EcMutL complex 
(ToneEcMut$-+EcMutL = 30 + 3 s; Extended Data Fig. 4d) was similar to that 
of the initial EcMutS-EcMutL complex assembly (Fig. 1b). However, 
the dissociation lifetime was ~8-fold shorter (ToffeEcMut$—+EcMutL 
= 3.8+0.3s; Fig. 2c), indicating that EcMutS and EcMutL were in 
complex ~90% of the time (Fig. 2d). The location of the oscillating 
EcMutS-EcMutL complexes occurred over the entire DNA length 
(Fig. 2b, Extended Data Fig. 4c). Interestingly, the EcMutL clamps 
appeared to bypass the EcMutS clamps in 12% of the oscillation events 
(n=67; Fig. 2b, lower white arrows). 

A third type of EcCMutS—EcMutL complex dissociation (22%) 
left EcMutS alone on the mismatch DNA that appeared to retain 
identical properties to the initially loaded EcMuts sliding clamps 
(Fig. 2a), whereas a fourth type (27%) left solitary extremely 
stable EcMutL particles on the DNA (Ton.gemut = 851 + 155 5; 
Fig. 2e, f, Extended Data Fig. 4e) that diffused ~20 times faster 
than EcMutS sliding clamps and ~200 times faster than the 
EcMutS—EcMutL complex (Decmutt = 0.888 + 0.393 um? s~|; Fig. 2g, 
Extended Data Table 4, Supplementary Video 4). Dgcmuti increased 
with ionic strength, indicating that EcMutL diffuses with intermittent 
DNA contact”! (Fig. 2g, Extended Data Table 4). Both EcMutS and 
ATP were required to observe EcMutS-EcMutL complexes and stable 
fast-diffusing EcMutL particles on the mismatched DNA (Fig. 2h, 
Methods). 

The ATP dependence, stability and intermittent DNA contact 
suggest that the EcMutL particles could engender a second sliding 
clamp. To test this hypothesis, we examined MMR-defective 
mutant protein EcMutL(R95F), which is incapable of binding ATP 
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Figure 2 | The formation of an oscillating EcCMutS-EcMutL complex 
and fast-diffusing EcMutL. a, Representative kymographs and 
illustrations showing the different types of ECMutS—EcMutL complex 
dissociations (coloured letters). The frequency (%) of each dissociation 
type is shown (right, coloured numbers). b, Representative kymographs 
and illustration showing an oscillating ECMutS-EcMutL complex 

(white arrowheads indicates dissociation events). c, Dissociation 

time distribution (Toffbemuts—+EeMutL3; Mean + s.e.m.) for the oscillating 
EcMutS-EcMutL complex. d, Illustration of oscillating EcMutS—EcMutL 
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complex with lifetimes and calculated diffusion distances. Oscillations are 
indicated for illustration only and should be stochastic. e, Representative 
kymographs and illustration showing the dissociation and fast diffusion 
of EcMutL from an EcMutS-EcMutL complex. f, Dwell time distribution 
(mean + s.e.m.) of fast-diffusing EcMutL on the mismatched DNA. g, Box 
plots of D for fast-diffusing EcMutL at different NaCl concentrations. 

h, The frequency of EC¢MutS-EcMutL complex and fast-diffusing ECMutL 
under various conditions (mean + s.d.); n= number of DNA molecules. 
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Figure 3 | ATP binding by EcMutL results in formation of a ring-like 
sliding clamp. a, Representative kymographs showing the formation 
of two separate EcMutS-EcMutL(R95F) complexes on a mismatched 
DNA. b, Dwell time distribution for the ECMutS—EcMutL(R95F) 
complex (Ton.EcMutS-EcMutL(R95F); Mean + s.e.m.). ¢, Box plots of D for the 
EcMutS sliding clamp (S); EcMutS-EcMutL complex (S-L); EcMutS- 
EcMutL(R95F) complex (S-L(R95F)); and EcMutL particle (L) at 
100mM NaCl. d, The frequency of EcMutS-EcMutL complex (blue) and 
fast-diffusing EcMutL (grey) with wild-type EcMutL or EcMutL(R95F) 
(mean + s.d.). e, Pie charts showing the distributions of dissociation types 


or activating EcMutH!*4 (Extended Data Tables 1, 2, Extended 
Data Fig. 2a, b). Numerous EcMutS-EcMutL(R95F) complexes 
were observed on the mismatched DNA that displayed a lifetime 
(TonsEcMutS-EcMutL(R95F) = 32 £2) and diffusion (DgcMuts—EeMutL (R95) 
= 0.003 + 0.002 1m? s~') similar to the wild-type EcMutS-EcMutL 
complex (85% of the particles; Fig. 3a-c, Extended Data Tables 3, 4). 
These results confirm that EcCMutL(R95F) associates normally with 
EcMuts sliding clamps'®. However, oscillating ECMutS—EcMutL(R95F) 
and stable fast-diffusing ECMutL(R95F) were never observed (Fig. 3d, 
Methods), with the majority of complex dissociations leaving EcMutS 
alone on the DNA (77%, Fig. 3e). We conclude that a principal role for 
ATP binding by EcMutL is to establish an oscillating ECMutS—-EcMutL 
complex and/or stable fast-diffusing EcMutL. 

Pre-binding EcMutS with a poorly hydrolysable ATP analogue 
inhibits sliding clamp formation!®, because it induces a closed ring- 
like structure that without hydrolysis is unable to interact with the 
mismatched DNA?”>. Similarly, a non-hydrolysable ATP analogue 
bound to EcMutL might produce a closed ring-like structure that 
is incapable of interacting with EcMutS or forming stable EcCMutL 
particles on the mismatched DNA (Fig. 3f). As a control, pre- 
incubation of EcMutL with ATP did not significantly change the 
frequency of EcMutS—EcMutL complexes or stable EcMutL particles 
on the mismatched DNA (Fig. 3g, left; compared to Fig. 2h, Fig. 3d, 
Methods). However, pre-incubation of EcMutL with non-hydrolysable 
adenylyl-imidodiphosphate (AMP-PNP) substantially reduced the 
frequency of ECMutS—EcMutL complexes and stable EcMutL particles 
(Fig. 3g, middle, Fig. 3h, top), leaving mostly EcMutS sliding clamps 
on the DNA (Fig. 3i, top). Because EcMutL(R95F) cannot bind 
ATP, abundant EcMutS—EcMutL(R95F) complexes were observed 
following pre-incubation of AMP-PNP, which then never progressed to 
oscillating EcMutS—EcMutL or stable EcMutL particles (Fig. 3g, right, 
Fig. 3h, bottom, and Fig. 3i, bottom). These results are consistent with 
the idea that an open conformation of EcMutL is required to interact 
with EcMutsS sliding clamps, which then binds ATP to form a second, 
exceedingly stable, ring-like clamp. 
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for EcMutS-EcMutL or EcMutS—EcMutL(R95F) complexes. Frequency 
and dissociation types are indicated within the pie chart (see Fig. 2a). 

f, Illustration of ATP and AMP-PNP pre-incubation studies. 

g, The frequency of EcMutS-EcMutL complex and fast-diffusing 

EcMutL under various EcMutL or EcMutL(R95F) pre-incubation 
conditions (mean + s.d.). h, Illustration showing AMP-PNP induced 
EcMutL ring-like clamp closure (top) that does not affect ECMutL(R95F) 
(bottom). i, Representative kymographs showing the absence of EcMutS- 
EcMutL complexes and the presence of ECMutS-EcMutL(R95F) complexes 
following pre-incubation of EcMutL or EcMutL(R95F) with AMP-PNP. 


The EcMutH dissociation constant (Ky) with hemimethylated Dam 
GATC DNA is at least threefold higher than its cellular concentration”®”’. 
EcMutL activation of the ECMutH endonuclease might overcome this 
binding problem'®”8, although the mechanics of this progression are 
unknown. We only detected AF647-labelled ECMutH (Extended Data 
Table 1, Extended Data Figs 2a, 5a) bound to the mismatched DNA 
in <50mM NaCl (Extended Data Fig. 5b-g). However, when both 
EcMutS and EcMutL were present, numerous EcMutH particles were 
observed on the mismatched DNA at physiological ionic concentration 
(Fig. 4a, Methods). The majority of ECMutH-AF647 (89% of particles) 
co-localized and displayed coordinated diffusion with EcMutL-Cy3 
(Fig. 4b, Extended Data Table 3). No EcMutH binding was observed 
when EcMutL(R95F) replaced wild-type EcMutL (Fig. 4a, c), suggesting 
that EcMutH only associates with an ATP-bound EcMutL sliding 
clamp. 

By substituting EcCMutS-AF555 for EcMutL-Cy3, we observed 
EcMutS-EcMutL-EcMutH complexes (Fig. 4d) that oscillated 
with free EcMutS and EcMutL-EcMutH, and displayed similar 
ionic-strength-independent diffusion properties to the ECMutS-EcMutL 
complex (Dgemuts-EcMutL-EcMutH = 0.005 + 0.003 1m? s~!; Extended 
Data Table 4, Extended Data Fig. 5h, middle, Extended Data Fig. 5i, 
Supplementary Video 5). As predicted, labelled ECMutL-EcMutH 
complexes (with unlabelled EcMutsS) oscillated with a mean diffusion 
coefficient that was skewed faster than the EC¢MutS—EcMutL complex 
(P = 0.001; D(ecMutS)-EeMutL-EcMutH =0.010+0.010 jum? s7}; Fig. Ae, 
Extended Data Fig. 5h, left, Extended Data Table 4). The lifetime 
of the EcMutL-EcMutH complex appeared to be independent 
of EcMutS association and increased the intrinsic EC€MutH DNA 
binding kinetics by at least 1,000-fold (Ton.ecmutL-EcMutH = 187 + 6 s; 
compare Fig. 4f to Extended Data Fig. 5c). The diffusion coeffi- 
cient of free ECMutL-EcMutH complex was intermediate between 
EcMutS-EcMutL complex and EcMutL sliding clamp (P < 0.0001; 
DycMutL-EcMutH = 0.041 + 0.034 jum? s~!; Extended Data Fig. 5h, right; 
Methods) and increased with ionic strength, indicating that the 
EcMutL-EcMutH complex moves with intermittent DNA contact?! 
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Figure 4 | ECMutH binds EcMutL sliding clamps. a, The frequency of 
EcMutH-AF647 on the mismatched DNA in the presence of other MMR 
components (mean + s.d.). b, Representative kymographs of EcMutH- 
AF647 co-localization and diffusion with EcMutL-Cy3 (and unlabelled 
EcMutsS) on a single mismatched DNA. c, Representative kymographs 

of EcMutL(R95F)-Cy3 (and unlabelled EcMutS) diffusion on a single 
mismatched DNA. Note the absence of ECMutH-AF647. d, Representative 
kymographs of ECMutH-AF647 co-localization and diffusion with 


(Extended Data Fig. 5j, Extended Data Table 4). Together these obser- 
vations suggest that the EcCMutS—EcMutL-EcMutH complex is uniquely 
capable of engaging in an efficient 1D diffusion-mediated DNA search 
for a hemimethylated Dam GATC site. 

Our studies demonstrate that E. coli MMR employs a cascade of 
stable ATP-bound sliding clamps to modulate 1D diffusion mechanics 
along the DNA. These observations show how mismatch recognition is 
effectively communicated to a distant MMR initiation site and provides 
additional support for the molecular switch/sliding clamp model!®”?. 
In this model, EcMutS mismatch recognition forms ATP-bound 
sliding clamps that randomly diffuse with intermittent DNA contact, 
establishing a stable platform capable of attracting other MMR com- 
ponents!0!°.1819.30 (Extended Data Fig. 6a). The association of EcCMutL 
with an EcMuts sliding clamp creates a previously unrecognized EcMutL 
sliding clamp that displays binding oscillation with EcMutS (Extended 
Data Fig. 6b). Both an EcMuts sliding clamp and ATP binding by EcCMutL 
are required to create these configurations (Extended Data Fig. 6b). 
Single-molecule images of quantum-dot-labelled ScMsh2-ScMsh6 and 
ScMlh1-ScPms1 on duplex”? and mismatched DNA curtains!! appeared 
unable to detect these conformations (Supplementary Note 1). 

We confirmed that the N terminus of EcMutL interacts with EcMutS 
sliding clamps using Forster resonance energy transfer”? (FRET; 
Extended Data Fig. 5k, Supplementary Video 6). These observations 
are consistent with a molecular progression where one of the EcMutL 
N-terminal homodimer domains interacts with EcMutS”?, creating a 
scaffold for the remaining EcMutL domains to wrap around the DNA 
in concert with natural helical motions during complex diffusion 
along the backbone; ultimately triggering ATP-dependent N-terminal 
domain dimerization and clamp formation‘. The fast diffusion and 
intermittent DNA contact of EcMutL sliding clamps makes ATP- 
induced collapse of the linker arms between the N- and C-terminal 
domains unlikely*’. Indeed, the EcMuts sliding clamps were capable 
of passing through the EcMutL sliding clamps in 12% of binding- 
oscillation encounters. We conclude that the fundamental role of ATP 
binding by EcMutL is to form an exceedingly stable sliding clamp on 
mismatched DNA. This observation might explain the persistence of 
ScMIh1-ScPms1 during MMR in vivo**. 
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EcMutS-AF555 (and unlabelled EcMutL) on a single mismatched DNA. 
Arrowheads indicate association of EcMutS with EcMutL-EcMutH. 

e, Representative kymographs of EcCMutH-AF647 co-localization and 
diffusion with EcMutL-Cy3 on a single mismatched DNA following the 
dissociation of EcMutS. Arrowheads indicate association of EcMutH with 
EcMutL sliding clamp. f, Dwell time distribution of the ECMutL-EcMutH 
complex (Ton-EcMutL-EcMutH} Mean + s.e.m.). 


The EcMutS-EcMutL complex remains in continuous DNA contact 
and capable of performing an efficient 1D diffusion-mediated DNA 
search”. The oscillation of EcCMutS—EcMutL with free EcMutS and 
EcMutL sliding clamps suggests that the complex may search ~2 kb 
of DNA, dissociate, move ~2 kb, then re-associate and begin a new 
~2kb search. This segmented search cycle may interrogate ~12 kb of 
DNA during the limiting lifetime of EcMutS sliding clamps (Extended 
Data Fig. 6a). Visualizing EcMutS-EcMutL diffusion clearly contrasts a 
trapped TaMutS—TaMutL’. It is likely that the TaMutS-TaMutL studies 
were influenced by an obligatory FRET constraint near the mismatch 
and/or the use of thermophilic proteins that become active at higher 
temperatures. 

EcMutH exclusively associated with the ATP-bound EcMutL either 
in the ECMutS-EcMutL search complex or as a free sliding clamp 
(Extended Data Fig. 6c). Importantly, the ECMutL-EcMutH complex 
diffuses with intermittent DNA contact, albeit ~20-fold slower than 
EcMutL. A simple explanation is that EcMutH increases the sporadic 
DNA interactions slowing EcMutL-EcMutH diffusion along the 
DNA. Nevertheless, the intermittent DNA contact implies that the 
EcMutL-EcMutH complex is probably not capable of performing an 
efficient DNA search. In contrast, the ECMutS—EcMutL-EcMutH com- 
plex remains in continuous contact with the DNA backbone allowing it 
to engage in a 1D diffusion-mediated search that may efficiently locate 
a distant hemimethylated Dam GATC site (Extended Data Fig. 6d). 

A mechanism that utilizes modulated stochastic diffusion along the 
DNA to initiate excision contrasts most historical MMR models®*!*, 
The notable conservation of MSH and MLH/PMS proteins implies that 
a similar mechanics is probably widely used for exploring the DNA 
backbone and activating distant MMR processes; it might also be 
used by MSH and MLH/PMS to connect damage sensing to the DNA 
damage response machinery’. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

Buffers and experiment conditions. The single-molecule imaging Buffer A 
contains 20 mM Tris-HCl (pH 7.5), 5mM MgCh, 0.1mM DTT, 0.2 mg ml"! 
acetylated BSA (Promega), 0.0025% P-20 surfactant (GE healthcare) and 100 mM 
NaCl (unless stated otherwise). To minimize photoblinking and photobleaching, 
imaging buffer was supplemented with a photostability enhancing and oxygen 
scavenging cocktail containing saturated (~3 mM) Trolox and PCA/PCD oxygen 
scavenger system composed of PCA (1 mM) and PCD (0.42 U ml~!, rPCO, OYC 
Americas)*>. 

Construction of 17.3-kb \-phage-based DNA with a single mismatch. 
-phage DNA (3.2nM, Thermo Scientific) was ligated with the lambda mismatch 1 
oligonucleotide (800 nM; Extended Data Fig. 1, Extended Data Table 1) at room 
temperature (22°C) overnight. The unligated oligonucleotides were removed by 
a 100kDa Amicon filter (Millipore). The resulting DNA (3.2nM) was then ligated 
with lambda mismatch 2 (800 nM) at room temperature overnight, followed by the 
removal of unligated oligonucleotides as above. The resulting DNA (1.6nM) was 
cyclized at 18°C overnight and digested with Bsal at 37°C for 4h. The Bsal-treated 
DNA was then ligated with the lambda linkers (~2 1M, Extended Data Table 1) at 
18°C overnight. The DNA ligation products were separated on a 0.5% low melting 
agarose (Promega) gel; the 17.3-kb band was excised and treated with }-agarase 
(New England Biolabs) followed by isopropanol precipitation. The purified DNA 
was resuspended in TE buffer (10 mM Tris-HCl, pH 7.5, 1 mM EDTA) and stored 
at —20°C until use. To methylate the mismatched DNA (Fig. 4, Extended Data 
Fig. 5h-j), 1 pg mismatched DNA was incubated with 801M S-adenosylmethionine 
and 8 U of Dam methyltransferase (New England Biolabs) at 37°C for 2h in an 
100-11 reaction, followed by inactivation of the enzyme at 65°C for 15 min. Control 
experiments demonstrate that the DNA becomes completely resistant to Mbol, 
indicating full methylation. 

Plasmid construction, MMR protein labelling and purification. EcMutS and 
EcMutL expression constructs have been previously described'®. The EcCMutL(R95F) 
point mutation was generated using the QuikChange site-directed mutagenesis 
kit (Stratagene). Hexa-histidine (hiss) and formylglycine-generating enzyme 
(FGE) recognition hexa-amino acid sequence (LCTPSR; ald¢) were introduced 
on to the C terminus of all MMR proteins with the exception of ECMutL(N-ter) 
and EcMutH, in which the tags were introduced on to the N terminus using a pre- 
viously described protocol'®. Two glycine residues separated the his, and ald, from 
one another and these tags were separated from the MMR proteins by two serine 
residues. The order of these tags relative to the MMR gene is indicated in Extended 
Data Table 2. The E. coli mutH gene was amplified by PCR (Extended Data Table 1), 
digested with NdeI and BamHI, and inserted into pET-29a (Novagen) bacterial 
expression plasmid. The resulting construct was amplified in E. coli XL10 gold 
(Stratagene) and verified by DNA sequencing. All the EcMutS and EcMutL 
proteins were expressed, labelled and purified as described previously'®. ECMutH 
was expressed, labelled and purified by a protocol similar to EcMutS. All of the 
hisg- and aldg-tagged MMR gene constructs used in these studies were shown to 
have a wild-type phenotype by complementation analysis in respective MMR- 
deficient E. coli strains except ECMutL(R95F), which was genetically a mutator 
indicating an MMR-deficiency as a result of a complete inability to bind ATP® 
(Extended Data Table 2). We used EcMutS(D835R,R840E) since the combined 
D835R and R840E substitution mutations eliminate the anomalous confounding 
effects of EcMutS protein tetramerization in vitro*! (Extended Data Table 2). The 
labelling efficiencies of ECMutS-AF647 dimer (45%), EcMutL-Cy3 dimer (33%), 
EcMutL(R95F)-Cy3 dimer (29%) and EcMutH-AF647 monomer (50%) were 
determined by spectrophotometry as described'® (Extended Data Table 1). 

Total internal reflection fluorescence microscopy and single-molecule studies. 
All the single-molecule data in this study were acquired on a custom-built prism- 
type total internal reflection fluorescence microscope based on the Olympus 
microscope body IX71. Fluorophores were excited using the laser lines (532 nm 
for green, 635 nm for red) in the single-molecule total internal reflection fluores- 
cence microscopy system. Image acquisition was performed using an EMCCD 
camera (Pro—EM Exelon512, Princeton Instruments) after splitting emissions by 
a Dual View optical setup (DV2, Photometrics). Micro-Manager image capture 
software was used to control the opening and closing of a shutter, which in turn 
controlled the laser excitation®. 

The 17.3-kb mismatched DNA (300 pM) in 30011 T50 buffer (20 mM Tris- 
HCl, pH 7.5, 50mM NaCl) was injected into the flow cell chamber and stretched 
by laminar flow (250,11 min~!). The stretched DNA was anchored onto a 
streptavidin-coated, PEG passivated quartz slide surface, and the unbound DNA 
was flushed by similar laminar flow. 

To detect transiently bound EcMutL or EcMutH on DNA (Extended Data Figs 3, 
5b-g), EcMutL or EcMutH (2-10nM) in imaging buffer was introduced into the 
flow cell chamber and protein-DNA interactions were monitored in real-time in 


the absence of flow at ambient temperature. The DNA was stained with Sytox 
Orange (250 nM, Invitrogen) or Syto 59 (700 nM, Invitrogen) after recording. 

To measure the interaction between EcMutS and EcMutL, EcMutS (10 nM) 
and/or EcMutL (20 nM) in imaging buffer plus 1mM ATP (unless stated other- 
wise) were introduced into the flow cell chamber and protein-DNA interactions 
were monitored. 

Fully methylated DNA was used in the EcCMutS-EcMutL-EcMutH experiments 
(Fig. 4). To measure the interaction between EcMutL and EcMutH, unlabelled 
EcMutS (5nM), EcMutL-Cy3 (10nM) and EcMutH-AF647 (5nM) in imaging 
buffer plus 1mM ATP were introduced. To measure the interaction between 
EcMutS and EcMutH, EcMutS—AF555 (10nM), unlabelled EcMutL (20nM) and 
EcMutH-AF647 (5 nM) in imaging buffer plus 1 mM ATP were introduced. 

To determine the EcMutL sliding clamp diffusion coefficient, EcMutS and 
EcMutL in imaging buffer (100 mM NaCl) plus 1 mM ATP were first co-injected. 
After 5 min the flow cell was flushed with imaging buffer containing 25-200 mM 
NaCl and 1mM ATP to remove unbound proteins and EcMutL sliding clamps 
were monitored. 

To determine the diffusion coefficient of the free ECMutL-EcMutH complex, 
unlabelled EcMutS (5 nM) and EcMutL-Cy3 (10nM) in imaging buffer (100 mM 
NaCl) plus 1 mM ATP were first co-injected. After 5 min the flow cell was flushed 
with imaging buffer containing 100 mM NaCl and 1 mM ATP to remove unbound 
proteins. ECMutH-AF647 (5 nM) in imaging buffer containing 50-150 mM NaCl 
and 1mM ATP was then introduced. 15 min after the wash diffusing ECMutH 
particles were monitored. 

AMP-PNP pre-incubation. EcMutL or EcMutL(R95F) (200 nM) was incubated 
with 2.5mM ATP or AMP-PNP in 100, Buffer A for 5 min. Ina separate reaction, 
EcMutS (11 nM) was incubated with 1.1mM ATP in 900 Buffer A for 30s. 
The two reactions were then mixed and introduced into the flow cell chamber 
immediately followed by the single-molecule imaging. 

Single-molecule data analysis. We found that 93% (52 out of 56) of the EcMutS 
sliding clamps displayed 1D diffusion along the mismatch DNA, whereas 7% 
(4 out of 56) remained on the mismatch (Extended Data Table 3). It is likely that 
the 7% EcMutS that remained on the mismatch reflects inactive proteins that do 
not respond to ATP binding!" and was excluded from further analysis. Molecules 
that diffused for at least 10s (300-1,500 ms frame rate) were included in the 
diffusion analysis. 

For studies involving EcMutS and EcMutL, fluorescent molecules in two 
channels were co-localized using a custom written MATLAB script. In these studies 
80% (43 out of 54) of the co-localized molecules displayed 1D diffusion, while 20% 
(11 out of 54) exhibited insignificant movement (Extended Data Table 3). Most 
of the stationary molecules appeared to be associated with the flow-cell surface 
and not subsequently stained DNA molecules (Extended Data Table 3). A similar 
distribution was observed for the EcMutS—EcMutL(R95F) complex (Extended 
Data Table 3). We excluded these immobile molecules from further analysis and 
only included molecules that diffused for at least for 10s (250-600 ms frame rate). 

In the presence of EcMutS and EcMutL, 89% (85 out of 95) of the ECMutH 
molecules displayed stable 1D diffusion along the DNA, while 11% (10 out of 95) 
exhibited insignificant movement (Extended Data Table 3). Most of the 
stationary molecules appeared to be associated with the flow-cell surface and 
not subsequently stained DNA molecules (Extended Data Table 3). We excluded 
these immobile molecules from further analysis and only included molecules that 
diffused for at least for 10s. 

EcMutL sliding clamps were easily distinguished from the EcMutS-EcMutL 
complex as a result of its extremely fast diffusion along the DNA (Fig. 2e, g). We 
included molecules with D > 0.05 1m? s"! for this analysis. 

Position determination on DNA. After the real-time measures, the 17.3-kb 
mismatched DNA was stained with Syto 59 (700 nM, Invitrogen) or Sytox Orange 
(250 nM, Invitrogen). The left (P,) and the right (Pr) end positions of the DNA 
were determined by plotting the fluorescent intensities along the length of the 
stained DNA (Extended Data Fig. 1d). Horizontal positions of diffusing particles 
(Pp) along the length of the DNA were tracked by DiaTrack 3.04 (Sydney, Australia), 
in which the particle intensities were fit to a two-dimensional Gaussian function to 
obtain their positions with sub-pixel resolution. The positions were then converted 
to lengths in bp by the following equation: 17,332 bp x (Pp-P,) / (Pr-Pi), where 
17,332 bp is the length of the mismatched DNA. A 1,000 bp (~2 pixels) binning 
size was used to construct the position histograms. 

Diffusion coefficient measurement and diffusion distance calculation. Particles 
were tracked using DiaTrack as described above to obtain single-molecule 
trajectories. Diffusion coefficients were calculated from the trajectories as 
described previously”. Briefly, the diffusion coefficient (D) was determined 
from the slope of a mean-square displacement (MSD) versus time plot using 
the equation MSD(t) =2 Dt, where t is the time interval. The first 10% of the 
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total measurement time was taken for point fitting. A minimum number 
of 50 frames were used to calculate the diffusion coefficients except for 
EcMutS—EcMutL complexes at 125mM or 150mM NaCl, in which 30 frames were 
used as the minimum because of the shortened lifetime. The MSD of each molecule 
was calculated by: 


N 
MSD(t) = = S> Geil) — x0)? 


i=1 


where t=nAT (n=1, 2, 3...., AT is the sampling rate), N= total number of 
diffusion events, x = horizontal position, and i= diffusion event. Diffusion 
coefficients were graphed in box plots (Figs. 1c, 2g, 3c, Extended Data Fig. 5) 
showing the mean (red line), median (indentation), the upper and the lower 
quartiles (box ends) and the outliers (whiskers). Diamonds indicate individual 
events. The average diffusion distance (d) was calculated using the equation: 
d=./2Dt, where t is the average lifetime of the particles on the DNA (Fig. 2d). 
Only EcMutS diffusion distance was reported during dissociation events of 
EcMutS—EcMutL complex. 

Lifetime measurement. A 100-ms or 200-ms frame rate was used to measure 
transient-bound EcMutL or EcMutH on DNA. A 1.5-s frame rate with 300-ms 
laser exposure time was used to minimize photo-bleaching during EcMutS 
sliding clamp or EcMutL sliding clamp lifetime measurements. A 1.0-s frame rate 
with 300-ms laser exposure time was used to minimize photo-bleaching during 
EcMutL-EcMutH complex lifetime measurements. A 600-ms frame rate with a 
300-ms laser exposure time was used to detect transient association-dissociation 
during EcMutS-EcMutL complex lifetime measurements. Kymographs were 
generated along the DNA by a kymograph plugin in Image] (J. Rietdorf and A. Seitz, 
EMBL Heidelberg). The dwell times of EcMuts sliding clamps, EcMutL particles or 
EcMutH particles on DNA were obtained from kymographs and binned to generate 
dwell time distributions, which were then fit to single exponential decay functions 
to determine the mean +s.e.m. For EcCMutS-EcMutL complexes, kymographs of 
both channels were merged and dwell times were obtained from the overlapping 
regions of the kymographs. Since we never observed EcCMutH DNA binding at 
physiological ionic conditions, the lifetime of the EcCMutL-EcMutH complex was 
obtained from kymographs of EcMutH channel alone. 

Binning method. All the binning methods in diffusion coefficient and lifetime 
analysis were described previously** and shown as below: 


Number of bins = [(N—1—-—1)-— round up] 
Max — min 
Number of bins — 0.5 


Size of bins = 


where: N= numbers of events; max = maximum value of events; min = minimum 
value of events; bin start = min; bin end = min + (bin size x number of bins). Bins 
distribution were then fit to standard curves*”. 

EcMutS-EcMutL complex, EcMutL sliding clamp and EcMutL-EcMutH 
complex frequency measurements. EcMutS-EcMutL complex, EcMutS- 
EcMutL(R95F) complex, EcMutL sliding clamp, and EcMutL(R95F) sliding clamp 
frequency measurements were performed at a 600-ms frame rate with a 300-ms 
laser exposure time. Following the infusion of proteins, single-molecule movies 
(10min) were recorded either immediately (Figs. 2h, 3g) or after 10 min incuba- 
tion to establish equilibrium (Fig. 3d). ECMutL or EcMutL(R95F) were tracked 
by DiaTrack and the trajectories were obtained as described above. EcMutL 
or EcMutL(R95F) with a minimum lifetime of 30s and a minimum diffusion 
coefficient of 0.1,1m? s~ were counted as the number of EcMutL sliding clamps 
(N, or Nirosz)). ECMutS and EcMutL (or EcMutL(R95F)) channels were merged 
and co-localized molecules with a minimum lifetime of 10s were counted as 
EcMutS-EcMutL or EcMutS—EcMutL(R95F) complexes (Ng, or Nsirosr))- 
Following the real-time single-molecule recording, the number of DNA molecules 
(Npna) was determined by Sytox Orange staining. The frequencies of EC¢MutL 
sliding clamps (F,), EcMutL(R95F) sliding clamps (Firosr)), ECMutS-EcMutL 
complexes (Fs) and EcMutS-EcMutL(R95F) complexes (Fs rosry) were calculated 
using the following equations that also included corrections for labelling efficien- 
cies of the proteins (the numbers in the denominator, Extended Data Table 1): 


N, Nirosr) 
Els = Ge ina Fyrosz) = ————— 
Npna X 0.33 Nona X 0.29 
Ne Nsi(RosF 
Fy = Su Fstrosk) = sda 


Nona X 0.33 x 0.45 Nona X 0.29 x 0.45 


EcMutL-EcMutH complex frequency measurement experiments were performed 
at a 600-ms frame rate with a 300-ms laser exposure time by using unlabelled 
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EcMutS (5nM), EcMutL-Cy3 (10 nM) and EcCMutH-AF647 (5nM). Following the 
infusion of proteins, single-molecule movies (10 min) were recorded immediately 
(Fig. 4a). EcCMutH was tracked using DiaTrack and the trajectories were obtained as 
described above. EcMutH particles with a minimum lifetime of 10 s were counted 
as the number of ECMutL-EcMutH complex (Nj). Following the real-time single- 
molecule recording, the number of DNA molecules (Npna) was determined by 
Syto 59 staining. The frequencies of EcMutL-EcMutH complex (Fi) were calcu- 
lated using the following equation that also included corrections for labelling effi- 
ciencies of the proteins (the numbers in the denominator, Extended Data Table 1): 


Nin 


FUy = —————_ 
Npna X 0.5 


All single-molecule studies were performed at least two separate times. 

Analysis of the distribution of ECMutS-EcMutL complex dissociation processes. 
Following the infusion of proteins, single-molecule movies (10 min) were recorded 
after a 10 min incubation to establish equilibrium (Figs. 2a, 3e). Kymographs of 
EcMutS—EcMutL complex or EcMutS-EcMutL(R95F) complex were obtained 
by merging the two channels. We included only the initial events for individual 
molecules in the analysis when multiple events between the same molecules were 
observed. 

MMR complementation in vivo. P. L. Foster (Indiana University) supplied the 
E. coli strains (all derivatives of MG1655 F- lambda- ilvG- rfb-50 rph-1). Mutation 
rates to rifampicin resistance were determined as previously described™, using 
at least seven independent colonies for each genotype. AmutS, AmutL and 
AmutH strains were co-transformed with respective MMR protein expression 
plasmids (Extended Data Table 2) and pTARA plasmid** (for T7 RNA polymerase 
expression, a gift from K. Matthews, Addgene plasmid 31491). Single colonies were 
picked and grown overnight in the presence of 50,.g ml! kanamycin, 35g ml! 
chloramphenicol, 0.2% Arabinose, and IPTG (0.05 mM for EcMutsS, 0.2 mM 
for EcMutL and 0.1 mM for EcMutH). As controls, single colonies of wild-type, 
AmutsS, AmutL and AmutH strains without plasmids were similarly grown 
overnight. Dilutions of overnight cultures were plated on LB-Agar plates with 
or without 100j1g mI“! rifampicin and allowed to grow overnight at 37°C. The 
colonies on LB with or without rifampicin were counted and the mutation rates 
were determined by fluctuation analysis®. 

EcMutH endonuclease activity assay. The d(GATC) endonuclease activity asso- 
ciated with EcMutH was assayed using 50 nM of the unmethylated FAM-labelled 
DNA substrate incubated with 41M EcMutH in a 50,11 reaction containing 20 mM 
Tris-HCl (pH 7.5), 5mM MgCl and 0.1mM DTT at 37°C for 1h. DNA was then 
denatured in 50% formamide at 95°C for 10 min and resolved on a 12% denaturing 
PAGE gel. 

Analysis of EcCMutS-EcMutL complex by smFRET. Particle positions were 
tracked with DiaTrack and particle trajectories were determined as previously 
described*®. The fluorescent intensities of the particles in both channels (Iazs555 and 
Iy¢647) were obtained from the trajectories using a custom written MATLAB script. 
A background correction was applied to all the intensities by using intensities in 
blank area. The apparent FRET efficiency was calculated as I p647/(Iarsss + Iare47)- 
Average free-energy barrier calculation for the ECMutS-EcMutL complex. 
To determine whether EcMutS-EcMutL complex diffusion includes rotational 
as well as translation diffusion we calculate the free-energy barrier of rotation 
along the DNA in aqueous solution (<)”*"°. These interactions can be described 
by diffusion on a rugged free-energy landscape by the following equation: 


; kgT 
[6mnRb? + 8rnR? + 6mNR(Roc)”] 


F(e) 


Where: 


kg = Boltzmann constant; T= temperature (Kelvin); 7)= viscosity of water; 
R= Stoke’s radius of the complex; b= (10B,)/(27), describes the effect of the 
helical pitch of DNA with regards to sliding; Bg= distance between two base pairs 
(0.34nm); Ro. = minimum distance between the protein centre of mass and the 
DNA axis. Here, Ro, is estimated to be zero, because the centre of mass of MutS or 
MutL sliding clamp is close to the DNA axis; and F(<) = exp(—(e/kgT)”), describes 
the fluctuating part of the potential function that obeys a Gaussian distribution”. 

The Stoke’s radii of an EcMutS dimer and an EcMutL dimer are 5nm and 9nm, 
respectively**°, There is no available Stoke’s radius for the ECMutS-EcMutL 
complex; however, it is expected to be larger than that of EcMutL. Thus, we 
estimated a Stoke's radius of 9 nm for the ECMutS—EcMutL complex. The calculated 
e in the equation using the experimentally determined diffusion coefficient for the 
EcMutS-EcMutL complex (0.004 + 0.002 1m? s~!) yields an energy landscape of 
1.74£0.2 kgT (s.d.). 
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Data availability. The data that support the findings of this study are available 


from the authors upon reasonable request. 
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Extended Data Figure 1 | The construction of mismatched DNA used 
in single-molecule total internal reflection fluorescence (smTIRF) 
microscopy. a, A schematic illustration for the construction of a 17.3-kb 
mismatched DNA. L or R (blue) indicates the orientation of the DNA 
relative to the L and R cos end of \-phage DNA. P (red) indicates 

the 5'-phosphate of the DNA. b, A schematic illustration of 17.3-kb 
mismatched DNA observation by prism-based smTIRF microscopy. 
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c, Representative mismatched DNA visualized by smTIRF microscopy 

in the absence of flow. The DNA was stained with Sytox Orange and a 

40 x 85m field of view is shown. d, A schematic illustration of the DNA 

length determination. e, The length distribution of the mismatched DNA 

observed by smTIRF microscopy. Gaussian fit of the data are shown along 
with the mean +s.d. 
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Extended Data Figure 2 | The fluorophore-labelled E. coli MMR Arg-95 (R95) is shown in magenta”*. c, An illustration of the kymograph 
proteins used in these studies and the formation of an EcMutS sliding construction of three separate EcMutS sliding clamps on a single 
clamp on DNA. a, Coomassie stained (top) and fluorescent (bottom) mismatched DNA. d, The distribution of diffusion coefficients for the 
SDS-PAGE gels of labelled MMR proteins. For gel source data, see EcMuts sliding clamp. The data were fit to a Gaussian with the mean + s.d. 
Supplementary Fig. 1. b, A crystal structure of the N-terminal domain e, The distribution of dwell times (mean + s.e.m.) for the EcMuts sliding 


of EcMutL bound to AMP-PNP (top) and magnification of the binding clamp. 
domain (bottom; PDB ID: 1B63). AMP-PNP is shown in green and 
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Extended Data Figure 3 | EcMutL does not bind DNA in physiological ionic conditions. a, b, Representative kymographs and dwell-times 
(mean + s.e.m.) for EcMutL binding to a mismatched DNA at various conditions. c-h, The distributions of dwell times (mean + s.e.m.) for ECMutL 
on mismatched DNA at different biochemical conditions as indicated. 
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Extended Data Figure 4 | Representative kymographs of EcMutS- drifting is negligible. c, Representative kymographs of oscillating EcMutS- 
EcMutL complex and EcMutL particles. a, Representative kymographs EcMutL complexes. Two channels (red, EcMutS; green, EcMutL) were 
showing the loading of EcMutL (green) on DNA by EcMuts sliding clamp merged. d, The distribution of the association times (mean + s.e.m.) for 
(red). b, Representative kymographs of a diffusing EcMutS-EcMutL EcMutS-EcMutL complexes during the oscillating phase. e, Representative 
complex (merged channels) and a non-diffusing EcMutS (red) in the kymographs of fast-diffusing EcMutL dissociation from EcMutS—EcMutL 
same field of view. The static kymograph of a non-diffusing EcMutS complexes. Two channels (red, EcMutS; green, EcMutL) were merged. 


indicates that the change in protein position caused by microscope stage 
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Extended Data Figure 5 | EcMutH lifetime on DNA and diffusion 
coefficient of ECMutS-EcMutL-EcMutH and/or EcMutL-EcMutH 
complex. a, A schematic illustration of ECMutH endonuclease assay 

(left) and a comparison of labelled or unlabelled EcMutH endonuclease 
activities (right). For gel source data, see Supplementary Fig. 1. 

b, c, Representative kymographs and dwell times (mean + s.e.m.) 

showing EcMutH on a single mismatched DNA under various ionic 

and magnesium conditions. d—g, The distributions of dwell times 

(mean + s.e.m.) for EcMutH on a single mismatched DNA at different 
biochemical conditions as indicated. h, Box plots showing D for oscillating 
(EcMutS)-EcMutLCy3-EcMutHA F647 ((S)-L-H) complex; the 
established ECMutSAF555-EcMutL-EcMutHAF647 complex (S-L-H); 
and free EcMutLCy3-EcMutHAF647 complex (L-H) at 100 mM NaCl. 

i, Box plots showing D for established EcMutS-EcMutL-EcMutH complex 
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at different NaCl concentrations. Two-sample t-test showed no significant 
difference between diffusion coefficients (P > 0.1). j, Box plots showing 

D for free EcMutL-EcMutH complex at different NaCl concentrations. 
Two-sample t-test showed significant differences between diffusion 
coefficients (P < 0.05). k, Top left, representative kymographs showing 
FRET between C-terminal AF555-labelled EcMutS and N-terminal 
AF647-labelled EcMutL (N-ter). Bottom, fluorescent intensities of 
EcMutS-AF555 (donor, green), ECMutL-AF647 (acceptor, red) and FRET 
(blue) between them when only the green laser was used for illumination. 
Right, a schematic illustration of kymographs. Experimental FRET 
measure (Egemuts-EcMutl, = 0.48 + 0.05; mean + s.d.) and theoretical 

FRET (Egemuts-EcMutL = 0.56) based on crosslink structure” appeared 
comparable. m= number of molecules throughout. 
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Extended Data Figure 6 | The interactions and kinetic properties EcMutH complex (see main text). Coil, 1D-diffusion search along the 
of the molecular switch/sliding clamp mechanism for E. coli MMR. backbone; dashed straight arrow, rotation-independent 1D-diffusion; 
a, Illustration of the kinetics and diffusion properties of EcMuts. black curved arrow, binding; red curved arrow, oscillating complex; 

b, Illustration of the kinetics and diffusion of EcMutS with dashed curved arrow, binding-dissociation; binding times and ATP («) 

are indicated. 


EcMuttL. c¢, Illustration of the kinetics and diffusion of EcMutS, 
EcMutL and EcMutH. d, Oscillation dynamics of the EcMutS—EcMutL- 
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Extended Data Table 1 | The oligonucleotides and fluorophore-labelled MMR proteins used in these studies 


Name* 


Sequences 


EcMutL(R95F) for 
EcMutL(R95F) rev 


EcMutL(N-ter) for 


EcMutL(N-ter) rev 
Lambda mismatch 1 
Lambda mismatch 2 

Lambda linker 1 
Lambda linker 2 
Lambda linker 3 
Lambda linker 4 


GCCATTATCAGCCTGGGCTTTTTCGGTGAGGCGCTGGCGAGTATC 
GATACTCGCCAGCGCCTCACCGAAAAAGCCCAGGCTGATAATGGC 


GGAGGCCATATGCTCTGCACACCGTCGCGTGGAGGGCATCACCATCACCATCATTCATCAATGCCAA 
TTCAGGTCTTACC 


TCCGCCCTCGAGTTACTCATCTTTCAGGGCTTTTATCG 
Phos-GGGCGGCGACCTGCTTAGGATCATCGAGGATCGACGTCGGTGCAATTCAGCGGACTAGTCC 
AGGTCGCCGCCCGGACTAGTCCGCTGAATTGCACCGACGTTGATCCTCGATGATCCTAAGC 
Phos-TTCTTGGAGTCCCCTGCAGCGATTAATACGACTAGAGC 
Biotin-GCTCTAGTCGTATTAATCGCTGCAGGGGACTCCA 
Phos-CGCTTAGTGCTATGATGCGTTCGATCACTCCATGTGATC 
Biotin-GATCACATGGAGTGATCGAACGCATCATAGCACTA 


EcMutH for1 AGGGCATCACCATCACCATCATTCATCAATGTCCCAACCTCGCCCA 

EcMutH for2 GGAGGCCATATGCTCTGCACACCGTCGCGTGGAGGGCATCACCATCACCA 

EcMutH rev GGAGGCGGATCCCTACTGGATCAGAAAATGAC 

Proteint Labelled monomer Unlabelled dimer i fs oto py dehe aol 

AF647 labelled EcMutS 26% 55% 38% 7% 

Cy3 labelled EcMutL 18% 67% 30% 3% 

Cy3 labelled EcMutlL(R95F) 16% 71% 26% 3% 

AF647 labelled EcMutH 50% N/A N/A N/A 


*Oligonucleotides used in these studies. 
tFluorophore-labelling efficiencies of MMR proteins used in these studies. 
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Extended Data Table 2 | Cellular complementation and mutation rates of MMR proteins 


MMR proteins in 


Strain expression Genotype Mutation ratest Range 
plasmids’ (per 10° cells) (per 10° cells) 

MG 1655 2 wild type 5.2 1.8-9.0 
MG 1655 AmutS = mutS” 522.0 334.9-860.8 
MG 1655 AmutS  Ra4oEy fice ald aaa 12 0.4-12.9 
MG 1655 AmutL 7 mutl- 1004.9 572.2-3481.1 
MG 1655 AmutL — EcMutL-hise-alde MutL* 6.3 0.4-9.5 
MG 1655 AmutL maa ail MutL (R95F) 702.3 29.5-1981.3 
MG 1655 AmutL — alde-hise-EcMutL MutL* 0.8 0.4-1.2 
MG 1655 AmutH : mutH” 100.2 77.8-371.9 
MG 1655 AmutH — aldg-hise-EcMutH MutH* 3.3 1.2.5.7 


*Amino acid substitutions are indicated in parenthesis; the arrangement of the hexa-histidine (hisg) and formylglycine-generating 


enzyme (aldg) recognition sequence relative to the N terminus (beginning) or C terminus (end) of the MMR gene. 
+The median mutation rates are reported. 
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Extended Data Table 3 | The frequency of particle varieties observed by single-molecule analysis 


i i : Co-localize with . Number of 
Pcs  BIRNSION sealed DNA Ratio aaa ee 
€ + 88% 
93% 
EcMuts only + - 5% 56 
+ * 7% 
+ + 69% 
80% 
* 9 
EcMutS-EcMutL 11% m 
complex 7 “ ai 
- a 16% 
= + 76% 
85% 
id 0, 
EcMutS-EcMutL i - 9% ‘ 
(R95F) complex - F 
- = 11% 
- * 80% 
89% 
EcMutL-EcMutH a - 9% 7 
complex 7 " oat 
- 5 10% 


*The DNA was probably broken during staining. 
timmobile particles bound near the mismatch. 
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Extended Data Table 4 | Diffusion coefficients 


Protein conformation [NaCl](mM) Mean+S.D. (10° um? s“) 


EcMutS sliding clamp 100 43 +16 
50 442 
75 443 
EcMutS-EcMutL 100 442 
complex ~ 
125 543 
150 543 
25 453 + 311 
100 888 + 393 
Fast diffusing EcMutL 
150 1716 + 623 
200 2220 + 778 
EcMutS-EcMutL(R95F) 100 342 
complex 
(EcMutS )-EcMutL- + 
EcMutH" 0 eae 
50 442 
EcMutS-EcMutL- 
EcMutH? oo ae 
100 543 
50 24 + 23 
Free pee dee 400 4a Sal 
complex’ 
150 118+ 151 


*Oscillating complexes potentially containing unlabelled EcMutS (EcMutS) with co-localized 
EcMutL—Cy3 and EcMutH-AF647. 

tA two-sample t-test suggests a significant difference (P=0.001) in diffusion coefficient between the 
EcMutS-EcMutL complex and (EcMutS)-EcMutL-EcMutH at 100 mM NaCl. 

+Complexes containing co-localized EcMutS—AF555, unlabelled EcMutL and EcMutH-AF647. 

§See Methods. 

\|A two-sample t-test suggests a significant difference (P< 0.0001) in diffusion coefficient between the 
free EcCMutL-EcMutH complex and (EcMutS)-EcMutL-EcMutH at 100 mM NaCl. 
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Genetic and mechanistic diversity of piRNA 3’-end 


formation 


Rippei Hayashi!*, Jakob Schnabl'*, Dominik Handler!, Fabio Mohnt, Stefan L. Ameres! & Julius Brennecke! 


Small regulatory RNAs guide Argonaute (Ago) proteins in a 
sequence-specific manner to their targets and therefore have 
important roles in eukaryotic gene silencing’. Of the three small 
RNA classes, microRNAs and short interfering RNAs are processed 
from double-stranded precursors into defined 21- to 23-mers by 
Dicer, an endoribonuclease with intrinsic ruler function. PIWI- 
interacting RNAs (piRNAs)—the 22-30-nt-long guides for PIWI- 
clade Ago proteins that silence transposons in animal gonads— 
are generated independently of Dicer from single-stranded 
precursors”*. piRNA 5’ ends are defined either by Zucchini, the 
Drosophila homologue of mitoPLD—a mitochondria-anchored 
endonuclease*®, or by piRNA-guided target cleavage®’. Formation 
of piRNA 3’ ends is poorly understood. Here we report that two 
genetically and mechanistically distinct pathways generate piRNA 
3’ ends in Drosophila. The initiating nucleases are either Zucchini or 
the PIWI-clade proteins Aubergine (Aub) or Ago3. While Zucchini- 
mediated cleavages directly define mature piRNA 3’ ends®?, Aub/ 
Ago3-mediated cleavages liberate pre-piRNAs that require extensive 
resection by the 3/-to-5’ exoribonuclease Nibbler (Drosophila 
homologue of Mut-7)!*!%, The relative activity of these two pathways 
dictates the extent to which piRNAs are directed to cytoplasmic or 
nuclear PIWI-clade proteins and thereby sets the balance between 
post-transcriptional and transcriptional silencing. Notably, loss of 
both Zucchini and Nibbler reveals a minimal, Argonaute-driven 
small RNA biogenesis pathway in which piRNA 5’ and 3’ ends are 
directly produced by closely spaced Aub/Ago3-mediated cleavage 
events. Our data reveal a coherent model for piRNA biogenesis, and 
should aid the mechanistic dissection of the processes that govern 
piRNA 3’-end formation. 

piRNA biogenesis is initiated by endonucleolytic definition of 
piRNA 5’ ends (Fig. 1a). Based on the nuclease involved, this defines 
primary (processed by Zucchini)** and secondary (processed by Aub/ 
Ago3)®” piRNAs. Attributes of these two endo-cleavages are a5’ uridine 
signature for primary piRNAs, and a 10-nt 5’ sense/antisense offset for 
secondary piRNAs (ping-pong signature). Following 5’-end cleavage, 
piRNA intermediates are anchored with their 5’ ends in PIWI proteins 
before their 3’ ends are matured!*> (we refer to these as pre-piRNAS). 
Zucchini also liberates 3’ ends of primary and secondary piRNAs*®”. 
In zucchini mutants, however, Aub/Ago3-bound secondary piRNAs 
are still abundant!®, indicative of alternative 3’-end formation path- 
ways (Extended Data Fig. 1a). The 3’ ends of these piRNAs lack signs 
for endonucleolytic processing, such as a coupling signature stem- 
ming from Zucchini-mediated, phased piRNA biogenesis®”, ora 
3//5' ping-pong signature indicative of slicer-mediated 3’-end formation 
(Extended Data Fig. 1b). This supports the hypothesis that Zucchini- 
independent piRNA 3’ biogenesis involves exonucleolytic resection of 
pre-piRNAs that have been generated by Aub/Ago3 (Fig. 1a)’. 

We identified piRNA 5’ species (piRNAs with the same 5’ end) 
that—besides piRNAs in the 23-29 nt range—exhibit one abnormally 


long isoform that extends to the cleavage position of a complementary 
piRNA (Fig. 1b). These isoforms are also found in libraries from 
immuno-purified PIWI proteins (Fig. 1b), indicating that they 
represent Aub/Ago3-loaded pre-piRNAs whose 3’ ends have been 
formed by slicing and await trimming. Consistent with this, the long 
isoforms lack 2’-O-methylation at their 3’ ends (Fig. 1b). 

To identify the 3’ exonuclease involved, we used a piRNA 
biogenesis reporter that recapitulates Zucchini-independent piRNA 
3'-end formation®; expression of a reporter with two target sites 
for cellular piRNAs forces the generation of responder piRNAs in 
Zucchini-depleted ovaries (Fig. 1c, d). We combined this reporter 
with a double-shRNA expression cassette to co-deplete Zucchini and 
a gene of interest (Extended Data Fig. 2a-c). A strong candidate for 
the exonuclease is the PARN-like nuclease PNLDC1, which trims 
pre-piRNAs in silkworm!’. As PARN-family nucleases are absent 
in Drosophila, we tested instead the mitochondria-anchored Tudor/ 
KH-domain protein Papi, an essential PNLDC1 co-factor in silkworm 
(Extended Data Fig. 2d, e)!”'®. Co-depletion of Zucchini and Papi does 
not impair piRNA generation from two independent reporters (Fig. 1d, 
Extended Data Fig. 2b, f), and global piRNA levels are comparable 
between Zucchini- versus Zucchini and Papi co-depleted ovaries 
(Extended Data Fig. 2g). Notably, Piwi-bound piRNAs increase by 
approximately 0.5 nt in length in papi mutants (Fig. le, Extended 
Data Fig. 3a, j)®. As 3’ ends of Piwi-bound piRNAs are generated pre- 
dominantly by Zucchini®’, we conclude that Papi-assisted piRNA 
trimming—if conserved in flies—occurs downstream of Zucchini, 
consistent with its role in mouse and silkworm®”!”"'8, 

We next tested the 3/-to-5’ exoribonuclease Nibbler/Mut-7, which 
trims some micro RNAs (miRNAs) after their loading into Agol'*!), and 
which has been reported to modulate piRNA lengths'”'. Co-depletion 
of Zucchini and Nibbler (Extended Data Fig. 2c) ablates piRNA produc- 
tion from both reporters despite trigger piRNAs remaining abundant 
and silencing-competent (Fig. 1d, Extended Data Fig. 2f, h). 

Consistent with Nibbler acting on slicer-generated pre-piRNAs, it is 
enriched in perinuclear nuage together with Aub/Ago3, while Papi co- 
localizes with Zucchini at mitochondria (Fig. 1f, Extended Data Fig. 2d, e). 
In aubergine mutants, Nibbler’s nuage localization is reduced, yet Nibbler 
does not enrich in Krimper foci where unloaded Ago3 accumulates 
(Fig. 1f)'®!?°. Nibbler’s co-localization with Aub/Ago3 therefore prob- 
ably depends on these factors being loaded with pre-piRNAs. We did 
not detect robust interactions between Nibbler and Aub/Ago3 by co- 
immunoprecipitation (weak interactions between Nibbler and Piwi were 
detected’), hinting at a transient interaction (Extended Data Fig. 2i, j). 

To characterize Nibbler’s role in piRNA biogenesis we generated flies 
that express no detectable Nibbler protein (Extended Data Fig. 3b, c). 
As reported!*3 , nibbler mutants are viable and fertile, but defective in 
mir-34 maturation (Extended Data Fig. 3d). Also as reported, localiza- 
tion and abundance of PIWI proteins, overall piRNA levels, and trans- 
poson silencing are not affected (Extended Data Fig. 3e—h). Average 
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Figure 1 | The 3’-to-5’ exonuclease Nibbler matures piRNA 3’ ends 
from slicer-cleaved pre-piRNAs. a, Schematic illustration of piRNA 5’ 
and 3’ biogenesis. b, Northern blot against individual piRNA 5’ species 
(mature piRNAs: 23-29 nt) detects a 34-nt pre-piRNA (blue arrowhead). 
3’-end methylation probed by 3-elimination (miR-8 serves as control). 

To the right, sequencing counts of the corresponding piRNAs (normalized 
to 1 million miRNA reads; p.p.m.) from total small RNAs or from an Aub 
immunoprecipitation are shown. c, Schematic of the dual-site piRNA 


piRNA length, however, is mildly increased (Extended Data Fig. 3i; our 
sequencing libraries span 18-40 nt, Extended Data Fig. 4). Notably, this 
originates primarily from Ago3-bound piRNAs, which increase >1 nt 
in length (Fig. le). This supports a specific role for Nibbler in resecting 
Aub/Ago3-generated pre-piRNAs. Indeed, somatic Piwi-bound 
piRNAs, whose 3’ ends are generated by Zucchini, show no length 
change in nibbler mutants, in contrast to papi mutants (Extended Data 
Fig. 3j)!*19. These results indicate that Nibbler does not fine-tune 
piRNA length as proposed'', but instead represents the central 
exonuclease of a distinct piRNA 3’-end pathway that resects slicer- 
generated pre-piRNAs to mature piRNAs. 

If Zucchini endonuclease and Nibbler exonuclease act in separate 
pathways to generate piRNA populations with similar overall length, 
the 3’ profiles of piRNA 5’ species should differ in single-mutant 
ovaries. We inspected individual Aub/Ago3-bound piRNA 5’ species 
by northern blot analysis and sequencing (Fig. 2a, b). While piRNAs 
in Zucchini-depleted ovaries display a broad length profile (consistent 
with exonucleolytic resection), piRNAs in nibbler mutants display 
discrete length patterns with major isoforms typically being followed 
by uridine (downstream-U signature), a hallmark of Zucchini cleavages 
(Fig. 2a, b)*”. 

To generalize these findings, we determined the downstream-U 
signature and the 3’-end precision index for thousands of piRNA 5/ 
species bound to Piwi/Aub/Ago3. This allows several conclusions. (1) In 
agreement with the two-pathway model, the downstream-U signature 
increases in nibbler mutant ovaries, yet is ablated in Zucchini-depleted 
ovaries (Fig. 2c; Piwi- piRNAs are lost in the absence of Zucchini). 
(2) In wild-type ovaries, the downstream-U signature is strong for 
Piwi-bound piRNAs, intermediate for Aub, and very weak for Ago3 
(Fig. 2c), indicating that Zucchini acts predominantly on Piwi and Aub. 


biogenesis reporter. UTR, untranslated region. d, Levels of small RNAs 
(p.p.m.) mapping to the biogenesis reporter (5’ ends only) in indicated 
genetic backgrounds. e, Length profiles of TE-mapping small RNAs 
isolated from Piwi/Aub/Ago3-immuno-precipitates (genetic background 
indicated; P-values calculated with two-sided t-test). f, Confocal images 
showing localization of GFP-Nibbler, Aub and Ago3 in w!”"8 or aubergine 
mutant egg chambers (scale bars, 5 \1m; individual channels as inverted 
grey scale images). 


(3) The downstream-U signature correlates with the 3’-end precision 
index of piRNA populations (Fig. 2c). (4) In nibbler mutants, the 3’-end 
precision index increases for Piwi-, Aub-, and Ago3-bound piRNAs, 
indicating that Nibbler acts on all three PIWI proteins (Extended Data 
Fig. 5a). (5) Characteristics of piRNA 5’ ends do not correlate with the 
3’-end precision index (Extended Data Fig. 5b), arguing that 5’-end 
generation does not dictate the mode of 3’-end formation. 

In agreement with Nibbler and Zucchini acting in parallel pathways, 
the length profiles of wild-type piRNAs appear to be a composite of 
the two respective single-pathway profiles (Fig. 2a, b). Inspired by this 
observation, we computed the relative contribution of Nibbler and 
Zucchini for Aub/Ago3-bound piRNAs. For each of the approximately 
300 analysable piRNA 5’ species, we determined the Zucchini/Nibbler 
contribution at which the combined length profile best mimics the 
wild-type profile (Fig. 2d). For both, Aub- and Ago3-bound species the 
wild-type profiles can be accurately modelled from the single-pathway 
profiles (Extended Data Fig. 5c). This results in an approximate median 
70:30 dominance of Zucchini over Nibbler for Aub-bound piRNAs and 
an opposite ratio for Ago3-bound piRNAs (Fig. 2e), in agreement with 
the 3'-end characteristics of the respective piRNA populations in wild- 
type ovaries (Fig. 2c). Our data demonstrate that two parallel pathways 
with varying contributions form 3’ ends of Aub/Ago3-bound piRNAs: 
Zucchini generates most Aub-bound piRNAs, while Nibbler generates 
most Ago3-bound piRNAs. 

Zucchini-mediated piRNA 3’-end formation results in processing 
of the downstream precursor RNA into phased piRNAs bound to 
Piwi (referred to as triggering)*”. 3’-end formation by Nibbler instead 
prevents triggering owing to degradation of the downstream precursor 
(Fig. 3a). As Zucchini compensates for 3’-end formation in the absence 
of Nibbler, triggering levels should increase in nibbler mutants. Indeed, 
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Figure 2 | Two genetically independent pathways generate piRNA 

3’ ends. a, b, Northern blot analysis (loading control: miR-8) and 
sequencing counts for Ago3- (a) or Aub- (b) enriched piRNAs in indicated 
genetic backgrounds. Bar charts display fraction of reads with indicated 
length. Respective RNA sequences are shown. c, Sequence logos display 
nucleotide bias around dominant 3’ ends of Piwi/Aub/Ago3-bound 
piRNAs isolated from ovaries of indicated genetic background. Bar plots 


the occurrence of Piwi-bound piRNA 5/ ends immediately down- 
stream of Aub/Ago3-bound piRNA 3’ ends increases in nibbler mutant 
compared to wild-type ovaries (Fig. 3b). As expected, this increase 
is more pronounced for Ago3/Piwi linkages compared to Aub/Piwi 
linkages. We conclude that in wild-type ovaries, downstream slicing 
and subsequent Nibbler-catalysed pre-piRNA resection limits the 
extent of triggering, especially for Ago3-bound pre-piRNAs. 

In agreement with elevated triggering, the levels of Piwi-bound 
piRNAs increase at the expense of ping-pong piRNAs in nibbler 
mutants (Fig. 3c). It is unclear why Aub-bound, but not Ago3-bound 
piRNAs are reduced in the absence of Nibbler. Possibly this is due 
to Ago3 incorporating abnormally high levels of antisense piRNAs 
(Extended Data Fig. 6a, b). As a consequence of the shifts in piRNA 
populations, ping-pong signatures for nearly all transposable elements 
(TEs) decrease (Extended Data Fig. 6c). This indicates a competitive 
relationship between two piRNA 3/-end formation pathways with 
consequences for piRNA loading into PIWI proteins: Nibbler limits 
the extent at which slicer-induced piRNA biogenesis propagates into 
Zucchini-mediated downstream piRNA biogenesis that fuels nuclear 
Piwi. By contrast, Zucchini consumes piRNA precursors, reducing their 
participation in ping-pong during post-transcriptional regulation. 

These findings prompted us to re-examine the long-standing ques- 
tion of why secondary piRNA populations from some TEs remain 


590 | NATURE | VOL 539 | 24 NOVEMBER 2016 


display the nucleotide composition at the position following dominant 

3’ ends (piRNAs are split into ten equally sized bins sorted for their 
precision index). d, Shown is how the Zucchini/Nibbler contribution to 
3’-end formation of individual piRNA 5’ species was determined. e, Shown 
are the Zucchini/Nibbler contributions for 3’-end formation for Aub- and 
Ago3-bound piRNAs (left, as stacked bar diagram; right, as box plot; 
Tukey definition; ***P < 0.001 after Wilcoxon rank-sum test). 


abundant in Zucchini-depleted ovaries (‘robust TEs’; for example, Doc), 
while others collapse (‘sensitive TEs’; for example, J-element; Extended 
Data Fig. 6d)'°. We reasoned that only TEs with a minimal abundance 
of Nibbler substrates (reflected by abundant ping-pong piRNAs) could 
maintain piRNA biogenesis in the absence of Zucchini. That is because 
Zucchini generates piRNA 3’ ends independent of additional precursor 
cleavages, while Nibbler-mediated 3’-end formation requires a second 
piRNA-guided cleavage event close by (Fig. 3a). Indeed, ping-pong 
piRNA levels are substantially higher for robust compared to sensitive 
TEs in wild-type ovaries (Extended Data Fig. 6d). TEs with ping-pong 
piRNAs below a threshold level therefore cannot compensate for 
Zucchini loss, as the production of Nibbler substrates is too inefficient. 
As ping-pong is a feed-forward loop, this results in the collapse of 
piRNA biogenesis. 

As the most direct test for two separate piRNA 3’-end pathways, 
we co-depleted Zucchini and Nibbler. As expected from the piRNA 
biogenesis reporter experiments (Fig. 1d), this results in piRNA loss 
for some TEs, which generate abundant piRNAs in Zucchini-depleted 
ovaries (Extended Data Figs la and 7a). Surprisingly, however, piRNAs 
that map to several other TEs are only mildly affected (Extended Data 
Fig. 7a, b), total germline piRNA levels are reduced less than twofold 
compared to Zucchini-depleted ovaries (Extended Data Fig. 7c), and 
TE derepression is similar to Zucchini-depleted ovaries (Extended Data 
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Figure 3 | Nibbler and Zucchini set the balance between primary and 
secondary piRNA biogenesis. a, Schematic of the two-pathway model for 
piRNA 3/-end formation. b, c, Scatter plots based on piRNAs mapping to 
germline-dominant TEs (n = 63). b, Plotted are Z-scores of piRNA 3//5/ 
coupling in w!!!8 versus nibbler~/~ ovaries (orange, Aub-bound antisense 
piRNA 3’ ends to Piwi-bound piRNA 5’ ends; blue, Ago3-bound sense 
piRNA 3’ ends to Piwi-bound piRNA 5’ ends). c, Normalized contributions 
of Piwi/Aub/Ago3-bound piRNAs to total TE piRNA profiles in w!!/8 
versus nibbler~/~ ovaries. Stacked bar plots show sum for all analysed TEs. 


Fig. 7d). The remaining germline piRNAs in Zucchini and Nibbler 
co-depleted ovaries populate Aub/Ago3 (Piwi is largely lost; Extended 
Data Fig. 8a) and exhibit less-defined size profiles, with many piRNAs 
being abnormally long or short (Extended Data Fig. 8b). 

Based on a strong ping-pong signature, Aub/Ago3-mediated 
slicing defines the 5’ ends of Zucchini/Nibbler-independent piRNAs 
(Extended Data Fig. 8c). A closer look at their mappings provides an 
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explanation of how their 3’ ends are generated (Fig. 4a). In double- 
depleted ovaries, novel ping-pong pairs emerge between two distantly 
spaced ping-pong pairs, thereby reducing the cleavage intervals to 
around 20-30nt. Also, piRNA 3’ ends change from a bell-shaped profile 
(consistent with Nibbler-mediated exo-resection) to discrete profiles 
where a single dominating 3’ end precedes the 5’ end of a flanking 
piRNA by 1 nt (Fig. 4a). This suggests that two slicer events, spaced by 
one piRNA length, directly generate 5’ and 3’ ends of piRNAs. To test 
this prediction we turned to the piRNA biogenesis reporter with two 
cleavage sites spaced by 52 nt that is incompatible with piRNA biogene- 
sis in Zucchini and Nibbler co-depleted ovaries (Fig. 4b). Introducing a 
third central target site re-installs biogenesis for two responder piRNAs, 
whose 3’ ends map precisely to the downstream slicer sites (Fig. 4b). 

We systematically analysed Zucchini/Nibbler-independent piRNAs 
for two characteristic signatures, namely 3’/5’ coupling (nucleotide- 
precision phasing), and 3/5’ ping-pong (presence of complementary 
piRNA 5’ ends 10 nt downstream of piRNA 3’ ends). Both signatures— 
while absent in Zucchini-depleted ovaries—are pronounced in 
double-depleted ovaries, indicative of tightly spaced ping-pong pairs 
(Fig. 4c). When piRNAs are grouped into length cohorts, coupling of 
flanking piRNAs is apparent for all size classes (Fig. 4d). No piRNA 
coupling is observed in Zucchini-depleted ovaries, as here Nibbler 
allows ping-pong pairs to be spaced in a larger window (Fig. 4e, 
Extended Data Fig. 8d). 

Zucchini/Nibbler-independent piRNAs bound to Aub or Ago3 
retain their respective nucleotide bias of 1U and 10A (Extended Data 
Fig. 8e). Given the precise piRNA coupling, this explains why slicer/ 
slicer-generated piRNAs display downstream-1U and downstream-10A 
signatures (Extended Data Fig. 8e). As uridine residues are not spaced 
in pre-fixed patterns, this requires flexibility on the Argonaute-side to 
accommodate piRNAs with different lengths. Indeed, while approx- 
imately 80% of piRNA species in Zucchini-depleted ovaries are 
23-27 nt, this length accounts for only approximately 50% in Zucchini 
and Nibbler co-depleted ovaries (Fig. 4d, e, Extended Data Fig. 8b). 
Nucleotide-resolution northern blots confirm the existence of piRNAs 
as short as 21 nt and as long as 32 nt (Fig. 4f, Extended Data Fig. 9a). 
While these piRNAs are similarly abundant as the corresponding 
3'/5’ ping-pong Figure 4 | An Argonaute-only pathway 
generates piRNAs in the absence of Zucchini 
and Nibbler. a, Normalized 5/ and 3’ end counts 
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species in wild-type ovaries, their length is restricted to essentially a 
single isoform. 

When examining slicer/slicer-generated piRNAs for their 
2'-O-methylation status, a hallmark of mature piRNA 3’ ends*)”?, 
even 32-nt-long piRNAs are methylated (Fig. 4f). This extends to all 
piRNA sizes (Extended Data Fig. 9b), indicating that 2’-O-methylation 
can occur independently of Zucchini- or Nibbler-mediated 3’-end 
formation. As pre-piRNAs with similar length are not methylated in 
wild-type ovaries (Fig. 1b), Nibbler probably acts faster than Hen1. 

Together with recent findings®**’’, our data provide a blueprint for 
piRNA 3/-end formation. Two separate exonucleolytic pathways— 
initiated by endonucleolytic cleavages—are dedicated to pre-piRNA 
trimming: the Papi/PNLDC1 pathway’”'’, and the Nibbler/Mut-7 
pathway. This is probably an ancient pathway architecture as—similar 
to ping-pong**—all involved nucleases (Zucchini, PNDLC1, and 
Nibbler/Mut-7) are conserved from sponges to mammals (Extended 
Data Fig. 10). There are, however, interesting exceptions. (1) Nematodes 
have lost Zucchini and an entirely different small RNA biogenesis 
system fuels their PIWI proteins”. In C. elegans, PARN trims PIWI- 
bound small RNAs”*, while Nibbler/Mut-7 is required for 22G siRNA 
biogenesis”®. (2) While PARN or PNDLC1 have been sporadically 
lost in several lineages (for example, fish), only flies have lost both 
enzymes. We postulate that this central pre-piRNA exonuclease is 
dispensable in flies as here Zucchini directly forms mature piRNA 3’ 
ends. (3) Only two groups have lost Nibbler: all Anopheles species, and 
several mammals including rodents. Whether this indicates a lesser 
importance of efficient ping-pong in these species is currently unclear. 

The balance between the two 3’-end-generating pathways defines the 
extent to which precursors are processed into Piwi-bound versus Aub/ 
Ago3-bound piRNAs. Ultimately, this determines the ratio between 
transcriptional (Piwi) and post-transcriptional silencing (Aub/Ago3). 
Aub-bound piRNAs, which are antisense-biased, are more abundant 
than Ago3-bound sense piRNAs. Piwi-bound piRNAs, which are 
generated in response to Aub/Ago3-initiated triggering, are, however, 
antisense-biased’””*. Ago3-generated cleavage intermediates must 
therefore be transferred more efficiently to mitochondria for Zucchini- 
mediated 3’-end formation than Aub- generated intermediates®’. Also, 
Nibbler matures Ago3-bound pre-piRNAs probably more frequently 
compared to Aub-bound pre-piRNAs, because Aub-bound piRNAs are 
more abundant than Ago3-bound piRNAs. Consequently, two closely 
spaced cleavages will be more frequent downstream of Aub than of 
Ago3. 

In the absence of Zucchini and Nibbler, PIWI proteins accommodate 
2'-O-methylated piRNAs ranging in length from 20-34 nt. We speculate 
that the slicer-only pathway potentially represents an ancient small 
RNA-generating unit, onto which dedicated endo- and exonucleases 
were added during evolution in order to efficiently generate piRNAs 
of optimal length. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized and the investigators were not blinded to allocation during 
experiments and outcome assessment. 
Fly husbandry and strains. Flies were kept at 25°C. Short hairpin RNA (shRNA) 
fly lines were crossed to the maternal triple driver (MTD)-GAL4 line (#31777; 
Bloomington stock centre) to drive expression of the shRNA in the germ line. 
shRNA constructs for double depletion of Zucchini-Nibbler and Zucchini-Papi 
were inserted into attP27?. TRiP.GLO0111 line was used (#35227; Bloomington) 
for depleting Zucchini alone. GFP reporter constructs and GFP-tagged BAC 
rescue constructs were inserted into the attP40 landing site. A nibbler and a papi 
allele with frame-shift mutations were generated by CRISPR/Cas9 using guide 
RNAs TGACGCCACCTTGGACGCAA and CGAGCCGCCTTAACCGCATC, 
respectively, as previously described*’. N-terminally Flag-tagged nibbler allele was 
generated using a guide RNA CACGGGAAACCCGTGAGAAA. The resulting 
allele has an insertion of IDYKDHDGDYKDHDIDYKDDD after the start codon. 
w'118 strain was used as a wild-type control throughout the study except the 
analysis of TE expression from ovaries using RNA sequencing where an shRNA 
line against white crossed to MTD-GAL4 was used as a control”’. Flies were aged 
for 6 days and kept on apple juice agar plates supplemented with yeast paste to 
ensure consistent ovarian morphology. 
Construction of shRNA expression vectors. To achieve simultaneous knockdown 
of two genes with a single shRNA construct, we modified the Valium20 vector?! 
and inserted two tandem shRNA sequences. The two hairpins are separated by 
the sequence that spans the miR-6-3 and miR-6-2 hairpins in the genome in order 
to maximize efficient processing (referred to as miR-6 backbone in Extended 
Data Fig. 2a). Nhel/EcoRI and Agel/SpHI sites were used to clone the two shRNAs 
oligos. The modified shRNA expression cassette (restriction sites used for cloning 
of shRNA oligos are underlined and the miR-6 spacer sequence is in bold): 

TTCAGCCGCTAGCATGGACGTTCATGGATTCTAAAACGGATTACCAG 
GGATTTCAGTCGATGTGAATTCAGGCGAGCACGGCCAATTCCAACGA 
TTTGTCATTTGTGGCACGCATTTGTGTCACCTCAGTGCGAAAATTGAA 
AATTGTATTCAGCCACCGGTTGCATAAGGCTATGAAGAGATACGCCCT 
GCATGCAGGCGAG 

The shRNA oligo sequences are: 

Zucchini shRNA forward (guide miR sequence is underlined): 

CTAGCAGTCACGAACTTGATGCACAACAATAGTTATATTCAAGCATAT 
TGTTGTGCATCAAGTTCGTGGCG 

Zucchini shRNA reverse: 

AATTCGCCACGAACTTGATGCACAACAATATGCTTGAATATAACTATT 
GTTGTGCATCAAGTTCGTGACTG 

Nibbler shRNA forward (guide miR sequence is underlined): 

CCGGTAGTATGGTCAGTGATCTCAGTGTATAGT TATATTCAAGCATATA 
CACTGAGATCACTGACCATGCGCATG 

Nibbler shRNA reverse: 

CGCATGGTCAGTGATCTCAGTGTATATGCTTGAATATAACTATACACTG 
AGATCACTGACCATACTA 

Papi shRNA forward (guide miR sequence is underlined): 

CCGGTAGTTCGACATATCCTAGATCCTAATAGTTATATTCAAGCATATT 
AGGATCTAGGATATGTCGAGCGCATG 

Papi shRNA reverse: 

CGCTCGACATATCCTAGATCCTAATATGCTTGAATATAACTATTAGGAT 
CTAGGATATGTCGAACTA 
GFP-tagged Pacman rescue constructs. Recombineering of Pacman* rescue 
constructs was as described**. Papi (Pacman clone CH322-41G09) and Zucchini 
(CH322-41M17) were tagged with GFP-Flag tags C-terminally, and Nibbler 
(CH322-18104) was tagged N-terminally. 
Generation of piRNA reporter constructs. All piRNA reporter constructs were 
generated as described previously” using the following sequences (target sites for 
endogenous piRNAs are marked in bold): 

Reporter with two target sites (Fig. 1d): 

GGACGAGCTGTACAAGTAATGATCGAGTCAGATAAGCCGTATCAATC 
AAAGGTTATCAA AGAGCAACTGAAGTACAAGAAAATCGCCTCGGTTC 
AATGAAATAACTCCAAGAATGCTCATTGAAATGATACTGTCAAAATGTT 
TCACAGGAAACTACTCAAATATAACCAAAAATTGAAGCAGTCATCAAGT 
GAACGAAAGAAATGAAAACATGACACGTGATCAACTCGAGCGCATAGA 
AATTAATTTATAAATTGCAAAACCTAATTACGTAGCTAAACAAAAAACC 
CAAAAGAATAATGTAAGCAAAAAGAAAATTTAATCGTCATAATCGTCAC 
CAAACCTTTTGATTGGCGAGCATTTCATAGATGTTAAATTTTCCTAATT 
CGAGAATCCCAGGAGGATAAGCGATAGGGATGATCAGAA 

Reporter with two target sites (Extended Data Fig. 2f): 

GGACGAGCTGTACAAGTAATGATCGAGTCAGATAAGCCGTATCAAAT 
TTCAAAGGT TATCAAAGAGCAACTGAAAACCACCGTATAGTGACTATA 
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CAGCAACTCCCACTTAAAAAAGGAAATATTCATGAACCTGCTTTAGCC 
TAGGAAACTACTCAAATATAACCAAAAATTGAAGCAGTCATCAAGTGAA 
CGAAAGAAATGAAAACATGACACGTGATCAACTCGAGCGCATAGAAAT 
TAATTTATAAATTGCAAAACCTAATTACGTAGCTAAACAAAAAACCCAAA 
AGAATAATGTAAGCAAAAAGAAAATTTAATCGTCATAATCGTCACCAAA 
CCTTTTGATTGGCGAGCATTTCATAGATGTTAAATTTTCCTAATTCGAGA 
ATCCCAGGAGGATA AGCGATAGGGATGATCAGAA 

Reporter with three target sites (Fig. 4b): 

GGACGAGCTGTACAAGTAATGATCGAGTCAGATAAGCCGTATCAAAT 

TTCAAAGGTTATCAAAGAGCAACTGAAAACCACCGTATAGTGACTAAA 
TACCTACTCCCACTTAAAAAAGGAAATATTCATGAACCTGCTTTAGCC 
TAGGAAACTACTCAAATATAACCAAAAATTGAAGCAGTCATCAAGTGAA 
CGAAAGAAATGAAAACATGACACGTGATCAACTCGAGCGCATAGAAAT 
TAATTTATAAATTGCAAAACCTAATTACGTAGCTAAACAAAAAACCCAAA 
AGAATAATGTAAGCAAAAAGAAAATTTAATCGTCATAATCGTCACCAAA 
CCTTTTGATTGGCGAGCATTTCATAGATGTTAAATTTTCCTAATTCGAG 
AATCCCAGGAGGATAAGCGATAGGGATGATCAGAA 
Generation of Nibbler antibody. Purified His-tagged full-length Nibbler protein 
was used to generate the mouse anti-Nibbler monoclonal antibody. 
Antibodies used. The following primary antibodies were used. Mouse monoclonal 
anti-Piwi (8C-E4)” (western: 1/1,000; used for IP), rabbit anti-Piwi® (IF: 1/500), 
mouse monoclonal anti-Ago3 (7B4-C2)?” (western: 1/1,000; used for IP), mouse 
monoclonal anti-Ago3 (5H12-G12)’ (IF: 1/30), mouse monoclonal anti-Nibbler 
(IF: 1/400; western: 1/2,000), mouse monoclonal anti-Aub (8A3-D7)’ (IF: 1:500; 
western: 1/1,000; used for IP), mouse anti-ATP-synthase 5A (abcam 14748; IF: 
1/2000; western: 1/20,000). 

Secondary antibodies used were: goat-anti-mouse HRP (Invitrogen), Alexa 
Fluor 488 anti-mouse (Thermo Fisher), Alexa Fluor 568 anti-mouse (Thermo 
Fisher), Alexa Fluor 647 anti-mouse (Thermo Fisher), Alexa Fluor 568 anti-rabbit 
(Thermo Fisher). 

Immunofluorescence staining. Ovaries were dissected into 300 il PBS containing 
4% PFA and fixed for 25 min at room temperature (RT). Tissue was permeabilized 
3 times with PBX (1x PBS, 0.3% TritonX-100) and blocked with BBX (1x PBS, 
0.3% TritonX-100, 0.1% BSA) for 30 min. 2001 of primary antibodies diluted in 
BBX were added and ovaries were incubated while rotating over night at 4°C. 
After washing 3 times with PBX, the ovaries were incubated with the respective 
secondary antibodies diluted in BBX rotating overnight at 4°C. Subsequently, 
ovaries were washed three times with PBX before mounting. To the second 
washing step DAPI was added. Images were taken on a Zeiss LSM-780 Axio Imager 
confocal-microscope using a 40 x/1.3 EC plan-neofluar Oil DIC objective. All 
images were processed using Image]. 

Immunoprecipitation of PIWI-family proteins for piRNA sequencing. For each 
genotype, 600,11 of ovaries were dissected into 1x PBS on ice. 1.5ml of 1x RIPA 
buffer were added (50 mM Tris-HCl pH 7.5, 150mM NaCl, 1% TritonX-100, 0.1% 
SDS, 0.1% Na-deoxycholate, 1 mM EDTA, 0.1mM Pefabloc (Roche)) and the tissue 
was homogenized, using a glass tissue homogenizer, on ice. The lysate was cleared 
by centrifugation and diluted with 3 ml of IP dilution buffer (50 mM Tris-HCl 
pH 7.5, 150mM NaCl). Antibodies were coupled to M280 sheep anti-mouse IgG 
Dynabeads (Life Technologies). For Piwi and Aub IP, 150,11 of beads coupled to the 
respective antibody were mixed with 1.5 ml of lysate. For Ago3 IP, 30011 of beads 
coupled to Ago3 antibody were mixed with 3 ml of lysate. Antibodies used are 
indicated above. Lysates were incubated rotating at 4°C overnight. Subsequently, 
the beads were captured and washed seven times with IP wash buffer (50 mM 
Tris-HCl pH 7.5, 500 mM NaCl, 2mM MgCh, 10% glycerol, 1% Empigen). For 
Piwi IP, 150 mM NaCl was used instead of 500 mM NaCl. The bound RNA was 
extracted using acid-phenol:chloroform (Ambion) followed by ethanol precipi- 
tation. To visualize the extracted RNA during the cloning process 10% of it was 
labelled with [y-*?P]-ATP. 

Co-immunoprecipitation of Flag-Nibbler. 10011 each of freshly dissected ovaries 
from 1-2-day-old w!”* and flag-nibbler homozygous females were homogenized 
in lysis buffer (30 mM HEPES-KOH pH 7.4, 500 mM NaCl, 2mM MgCl2, 0.5% v/v 
Triton-X, 0.2mM DTT, and 1x cOmplete, EDTA-free protease inhibitor cocktail 
(Roche)) using a glass tissue homogenizer, on ice. The lysates were cleared by 
centrifugation and incubated with 30,1] of anti-Flag M2 agarose beads (Sigma) for 
4h at 4°C. Beads were captured and washed 5 times with the lysis buffer. IP fraction 
was eluted by incubating the beads with 30 11 of lysis buffer containing 0.5 1g jul! 
3 x Flag peptides for 15 min at RT with gentle shaking. 

2S rRNA depletion from total RNA. For the depletion of 2S rRNA from 101g 
of total ovarian RNA, 100,11 slurry of Myone Streptavidin C1 beads (Invitrogen) 
were used. The beads were washed twice with 500 1l 0.5 SSC. After washing, the 
beads were re-suspended in 300,11 0.5 x SSC and 1 1l of 100 1M 2s-rRNA-as-oligo 
(Bio-AGTCT TACAACCCTCAACCATATGTAGTCCAAGCAGCACT) were 
added and mixed. This mixture was incubated on ice for 30 min, followed by 
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a wash with 50011 of 0.5 x SSC to remove unbound 2s-rRNA-as-oligo. After 
resuspension of the beads in 10011, they were incubated at 65°C for 5 min. The 
RNA was denatured for 5 min at 80°C for 5min and mixed with the beads, 
followed by incubation at 50°C for 1h. Fraction not bound to the beads was 
collected and the RNA was ethanol precipitated and served as input for the small 
RNA cloning procedure. 

Small RNA cloning and sequencing. Small RNA libraries were prepared as 
described previously™. In brief, total RNA from respective ovaries was isolated 
using TRIzol, and was subjected to 2S-depletion*>. Oxidation was done by 
incubating 2S-depleted total RNA in borate buffer (pH 8.6) containing 25 mM 
NaIO, at room temperature for 30 min. RNA samples from total RNA and from 
IP experiments were resolved on a denaturing polyacrylamide gel and small RNAs 
ranging from 18-40 nt were excised and recovered. These were subsequently 
ligated to 3’ and 5’ adapters containing four random nucleotides at both ends to 
reduce ligation biases. Ligated RNA was reverse-transcribed, PCR amplified and 
the libraries were sequenced on an Illumina HiSeq 2500 machine in single-read 
50 mode. IP libraries of the Zucchini/Nibbler-depleted flies were sequenced in 
single-read 100 mode. 

RNA sequencing. PolyA+ RNA-seq was performed as described in Mohn et a 
using NEBNext Ultra directional RNA library prep kit for Illumina (NEB) and 
libraries were sequenced on Illumina HiSeq 2500 in paired-end 125 mode”. Six 
bases at each end of the reads were trimmed and the remaining part was split into 
three reads (37, 38, 38 nt). Reads were mapped against TE consensus sequences 
using Bowtie (0. 12. 9) allowing up to three mismatches”’. RPKM for each TE 
was calculated as reads per million genome mapped reads per kb. TEs that were 
expressed more than 1 RPKM in at least one of the libraries in the comparison 
(group1: w!!8 and nibbler~-, group2: control depletion, zucchini depletion and 
zucchini and nibbler depletion) were included for the analysis. 

Northern blot. Small RNAs were enriched from 501g of total RNA using size 
selection on a 12% denaturing urea-polyacrylamide gel. Oxidation was performed 
as mentioned above. elimination was done after the oxidation by adding 50mM 
fic. NaOH and incubating for 90 min at 45°C. The protocol for northern blotting 
was modified from Pall et al.>”. 

In brief, a 15% denaturing urea-polyacrylamide gel (0.4mm thick) was used 
to run the size-selected RNA samples. The gel was blotted to a Hybond-NX 
membrane (Amersham) in a semi-dry blotting apparatus. This was followed 
by chemical cross-linking using 0.373 g 1-ethyl-3-(3-dimethylaminopropyl) 
carbodiimide (EDC) (Sigma) powder in 12 ml 1x methylimidazole (Sigma) 
at 60°C for 1h. Afterwards the membrane was pre-hybridized in 10-15 ml 
(depending on membrane size) church buffer (1 mM EDTA, 0.5 M NagHPO,/ 
NaHPO, pH 7.2, 7% SDS) at 65°C for at least 20 min. DNA oligonucleotide probes 
were labelled using [~\-*’P]-ATP (PerkinElmer). The radioactively-labelled probe 
was added to the church buffer and incubated overnight at 32°C. The membrane 
was subsequently washed 3 times with 15 ml of 1x SSC + 0.1% SDS. Finally, a 
phosphor-storage screen was exposed to the membrane and developed using a 
phosphor imager. All images were processed using Image]. 

The following DNA oligos were used as probes: 

miR-34 (Extended Data Fig. 3d): 

AACCAGCTAACCACACTGCCA 

miR-8 (Figs 1b, 2a, b and 4f): 

GACATCTTTACCTGACAGTATTA 

piRNA enriched in Ago3 (Fig. 2a): 

GCGATTTTCTTGGGTTCAGTTGCT 

piRNA enriched in Aubergine (Fig. 2b): 

TCTTGGAGCTAACTTCTTTCGTA 

21-nt-long piRNA (Fig. 4f): 

TCTAGATTGGCTGCTATTAAA 

26-nt-long piRNA (Fig. 4f): 

AAGCTACTGAAGTCATACCTATA 

32-nt-long piRNA (Fig. 4f): 

AGAACAAACTGGCCAAGGTATCAA 

piRNA precursor (Fig. 1b): 

AGTCTGGAGTTCAAAGCTCTTCTA 
Processing of small RNA libraries. Processing and mapping of sequenced small 
RNA reads was carried out as described in Mohn et al.’. In brief, raw reads were 
trimmed off the 3’ linker sequence (AGATCGGAAGAGCACACGTCT) and the 
four random nucleotides at each end were removed. Processed reads were mapped 
to the Drosophila genome (dm3, release 5.55) using Bowtie (0. 12. 9) allowing zero 
mismatches. Reads that mapped to genomic regions annotated as TE were used for 
the subsequent analyses (filtered reads). Libraries from total RNA were normalized 
to 1 million miRNA reads. Normalization of PIWI-IP libraries was as described 
previously’. R (version 2.15.3) was used for statistical and graphical analyses unless 
otherwise indicated. 
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TEs that were included in the analyses. 63 TEs that fulfilled the following 
criteria were defined as germline-enriched TEs. 1) TEs that produced more 
than 1,000 p.p.m. of piRNAs (sum of sense and antisense piRNAs) in wild-type 
ovaries; 2) germline-derived Piwi-bound piRNAs constitute more than 85% of 
total Piwi-bound piRNAs””. 63 TEs were grouped into two groups based on the 
ratio of summed TE mapping reads of normalized Piwi/Aub/Ago3 IP libraries in 

Zucchini-depleted ovaries relative to the control depletion (group1: >0.2,n= 19, 

group2: <0.2, n=44, used in Extended Data Figs 1b and 6d). 11 out of 63 TEs that 

fulfilled the following criteria were used for the coupling/ping-pong analyses in 

Fig. 4c and Extended Data Fig. 8c. 1) TEs that produced more than 10% of piRNAs 

in Zucchini and Nibbler-depleted ovaries compared to the control depletion; 

2) TEs with a canonical ping-pong Z-score (see below) greater than 15. 

Statistical analysis of size profiles. 1,000 TE-mapping reads were randomly 

sampled 100 times. The size distribution of these 1,000 reads was compared 

between genotypes using a t-test. The mean P value of 100 comparisons was 
calculated to determine the significance. 

Linkage analysis. Linkage calculation was adapted from Webster et al.”°. First, 

filtered small RNA reads were mapped to TE consensus sequences”’ allowing 

three mismatches. Second, 5’ and 3’ ends of mapped reads were counted at each 
nucleotide position on both strands. For each linkage analysis, the statistical 
significance of observing a certain linkage was calculated as follows. 

(1) For example, for the calculation of canonical ping-pong, piRNA 5’ ends were 
randomly subsampled for a fixed number of times from antisense and sense 
mapped piRNAs. We sampled for the number of a fifth of TE length (bp). 
For instance, if the size of a TE is 5,000 bp, we sampled 5’ ends 1,000 times. A 
fixed number of samplings allows for a fair comparison between libraries of 
different sequencing depths. The shuf function in bash 4.2.37(1) was used to 
obtain random numbers. 

(2) We counted the number of 5’ ends of antisense piRNAs that find at least one 
5’ end of sense piRNA at 10-nt off-set (N). 

(3) We computationally repeated (1) and (2) for 500 times and calculated the 
mean of Nmean- 

(4) Suppose two sets of random 5’ ends, the probability of an antisense 5’ end to 
find a ping-pong partner p is 1 — (1 — 1/L)" where L is the length of TE. The 
expected number of N from a random set (Nexpectea) is L/5 x p. 

(5) Z-score was calculated as (Nmean — Nexpected)/o where the standard deviation 
ois (L/5 x px (1—p))°. 

We used the following sets of sequencing reads for calculation. Canonical ping- 

pong: 5’ ends of antisense and sense mapped reads from total libraries (10-nt off- 
set, Extended Data Figs 1b and 8c). 3/5’ coupling: 3’ ends and 5’ ends of antisense 
mapped reads from total libraries (1-nt off-set, Extended Data Figs 1b and Fig. 4c). 
3//5' ping-pong: 3’ ends of antisense mapped reads and 5’ ends of sense mapped 
reads (10-nt off-set, Extended Data Figs 1b and Fig. 4c). 
Mapping small RNA reads to reporter constructs. Responder and trigger 
piRNA reads were determined by mapping the total small RNA reads including 
genome-unmapped reads to reporter construct sequences using bowtie allowing 
zero mismatches. To determine trigger piRNA levels, only the first 20 nt of 
sequencing reads were used in order to account for 3’ heterogeneity. Mapped reads 
were normalized to 1 million miRNA reads. 

Measure the definition of 3’ ends. The analysis was adapted from Mohn et al.. In 

brief, piRNA 5’ end positions where the downstream 20-35 nt window had equal 

number of mappings in the reference genome were selected as unambiguous 5’ end 
positions. We only included the reads whose 5’ ends are defined as unambiguous 

(analysable reads). These analysable reads were collapsed on 5’ ends yielding the 

counts of each length per given 5’ end. 5’ ends that have more than 20 raw counts 

as well as more than 2 counts normalized to one million TE-mapping reads (p.p.m.) 

were included in the analysis. These cut-offs allowed the comparison of different 

sequencing depths as well as the confident assessment of 3’-end variants. Finally, the 
fraction of the counts representing the dominant length per 5’ end was calculated as 

a percentage (definition). For the analysis of somatic piRNAs, piRNAs that mapped 

to soma-enriched 1 kb tiles were used with a cut-off of soma-index greater than 8°°. 

Nucleotide analysis. Only piRNA reads that passed the requirements for 3/-end 

analysis (see above) were used for the analysis. The entire analysis is sequence- 

based, not read-based (cloning frequency is therefore not considered). Nucleotide 
windows surrounding the respective positions were extracted using get-fasta 
from the fastX-toolkit (Hannon laboratory). For each 5’ end, the dominant 

3’ end position was used to extract the surrounding sequences. When indicated, the 

5’ ends were binned into ten groups containing an equal number of 5’ ends 

according to the definition of their 3’ ends. The nucleotide signatures were 

generated using weblogo 3.4 and Prism 6 was used for visualization. 

Contribution analysis for Nibbler and Zucchini. The relative contribution of 

Nibbler and Zucchini in forming the 3’ ends of Aub- or Ago3-bound piRNAs was 

modelled as follows: 
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Analysable piRNA 5’ ends (see above) with at least 20 p.p.m. in the respective 
IP libraries of all three genotypes (w!”%, nibbler~/~, and Zucchini depletion) were 
included in the analysis. First, the 3’-end composition of individual 5’ ends in 
a window of 20-30 nt of length was determined. The 3’-end profile of piRNAs 
from nibbler-/~ and Zucchini-depletion libraries were summed in different ratios 
(0-100% and the reciprocal values) to generate a combined profile. Second, the 
combined profile was compared to the w!”"° profile using a linear regression model. 
The R? value was used to assess the best fit. Finally, the ratios of Nibbler versus 
Zucchini contribution were extracted at the maximal R? value. 

Heat maps for 3/-to-5’ coupling. Heat maps were generated as previously 
described’. In brief, filtered reads (see above) from total small RNA libraries were 
mapped to TE consensus sequences allowing up to three mismatches. We selected 
pairs of complementary reads with 10-nt 5’ overlap (ping-pong piRNA pairs). 
Ping-pong piRNAs that were cloned greater than 10 p.p.m. and more than 2% of 
the most abundant piRNA per individual TEs were selected for the analysis. We 
extracted the counts of 5’ and 3/ ends mapped in the window of +100 nt for each 
ping-pong piRNA (two instances per pair) using annotatePeaks (Homer v.4.5) 
and transformed the counts at each position to a percentage value (read count 
table). The read count tables per ping-pong piRNA were sorted by the length of 
the dominant ping-pong responder piRNA and by the relative abundance of the 
ping-pong responder piRNA in the window, and were assembled into heat maps. 
The distribution of 5/-to-5’ distance of flanking ping-pong piRNAs was calculated 
by summing the read count table of all ping-pong piRNAs in the respective 
genotypes (Extended Data Fig. 8d). 

Orthologue search. Orthologues for Zucchini/MitoPLD, PARN, PNLDC1, and 
Nibbler/Mut7 were first searched using orthoDB v.9 against metazoan species 
(http://www.orthodb.org/)**. Orthologue groups (EOG091G0FI5: Zucchini/ 
MitoPLD, EOG091G04CM: PARN/PNLDC1, EOG091G04KJ: Nibbler/Mut7) 
were analysed. Protein entries that were not annotated with the following pfam 
domains were excluded: Zucchini/MitoPLD: PLDc_2 (PF13091), PARN/PNLDC1: 
CAF1 (PF04857), Nibbler/Mut7: DNA_pol_A_exol (PF01612). Pfamscan (http:// 
www.ebi.ac.uk/Tools/pfa/pfamscan/) was used to search for the domains (pfam-A, 
E-value: 0.1). We defined PARN as proteins that have RNA_bind (PF08675) and/or 
RRM_7 (PF16367) and PNLDC1 as proteins that do not have these annotations. 
Species that did not have orthologues in orthoDB were further searched for 
orthologues first by BLASTp (NCBI) against non-redundant protein sequences 
and second by tB3LASTn (NCBI) against whole genome shotgun sequences using 
the default cut-offs. Hits from BLASTp and tBLASTn were reciprocally surveyed 
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for predicting the query proteins as the most confident homologue. All hits were 
combined and displayed in a phylogenetic tree (either presence or absence) using 
iTOL (http://itol.embl.de/)®. 

Code availability. The code that underlies the computational analyses in this 
manuscript is available at https://gitlab.com/groups/Genetic_and_mechanistic_ 
diversity_of_piRNA_3prime_end_formation. 

Data availability statement. All sequence data that support the findings of 
this study (see SI) have been deposited in NCBI GEO with the accession code 
GSE83698. All fly lines used in this study are available from VDRC (http:// 
stockcenter.vdrc.at/control/main). Source data for all gel images are provided with 
the paper (see Supplementary Information). 
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Extended Data Figure 2 | Nibbler, but not Papi, is required for the 
generation of piRNAs in the absence of Zucchini. a, Design of the short 
hairpin expression cassette that allows the simultaneous RNAi-mediated 
knockdown of two genes in a tissue specific manner. shRNAs are separated 
by the miR-6 backbone and can be cloned via indicated restriction sites. 

b, c, Confocal sections of egg chambers (scale bars, 101m) of indicated 
genotype expressing GFP-tagged Zucchini (b, c) and GFP-tagged Papi (b), 
showing the efficient shaRNA-mediated knockdown of Zucchini and 

Papi (b), or Zucchini and Nibbler (c) in the germ line. Nibbler was 
detected using a monoclonal antibody. d, Confocal images showing the 
localization of GFP-tagged Papi and Nibbler in Drosophila egg-chambers 
(scale bars, 101m). Functionality of GFP-Nibbler is demonstrated 

in Extended Data Fig. 3d. e, Confocal sections through single nurse 

cell nuclei of egg chambers expressing GFP-tagged Zucchini (left) or 

Papi (right) and stained for mitochondria (immuno-staining of ATP 
synthase). f, Shown are mappings of piRNAs (5 ends only; red, sense; 
black, antisense) from ovaries of indicated genotype to a second reporter 


construct (as in Fig. 1d). Values are normalized to 1 million sequenced 
miRNA reads. g, Scatter plot displaying miRNA-normalized piRNA 
levels mapping to 63 germline-dominant TEs in Zucchini-depleted 
versus Zucchini/Papi-depleted ovaries. h, Shown are confocal sections 

of egg chambers expressing indicated piRNA biogenesis reporters (GFP- 
fluorescence; DNA stained with DAPI) in wild-type (top) or Zucchini 
and Nibbler-depleted ovaries (middle and bottom). Top, reporter with 

no target site; middle, reporter used in Fig. 1d; bottom, reporter used 

in Extended Data Fig. 2f. i, Ovary lysates expressing N-terminally Flag- 
tagged wild-type Nibbler was immunoprecipitated (IPed) using M2 
magnetic beads. Wild-type ovary lysates were used as a control. Red colour 
in the blot indicates a saturated signal. ATP synthase 5A (ATP syn) serves 
as loading control. Ovary volume indicates the amount of loaded lysate/ 
IP fraction. j, Eluates from IP were blotted with indicated antibodies. 
Piwi is slightly enriched in the Flag—Nibbler IP fraction, while there is no 
detectable enrichment of Aub or Ago3 in the IP fraction. 
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Extended Data Figure 3 | Molecular characterization of the piRNA 
pathway in nibbler and papi mutant flies. a, papi gene locus indicating 
the position of the Cas9-induced frameshift allele. b, nibbler gene locus 
indicating the position of the Cas9-induced frameshift allele. c, Western 


blot analysis showing the loss of detectable Nibbler protein in nibbler 


af 


ovaries. ATP-synthase 5A antibody is used as loading control. Loaded 
amounts of ovary lysates are indicated. d, Northern blot analysis of the 
Nibbler-substrate miR-34 comparing small RNAs obtained from ovaries 


of w 


1118 


or nibbler 


mf 


flies. The GFP-Nibbler rescue transgene (used in 


Fig. 1f) restores miR-34 processing. e, f, Immunostainings (e) and western 


blot analysis (f) of Piwi, Aub and Ago3 in w 


1118 


or in nibbler 


ais 


ovaries, 


soma enriched Piwi-bound piRNAs 


showing that localization and expression of the three PIWI-clade proteins 
are unperturbed (arrow heads; scale bars, 10j1m). ATP synthase 5A (ATP 
syn) served as loading control. g, Scatter plot displaying the steady-state 
RNA level of TEs in indicated genetic background (only TEs with RPKM 
>1 in either background; n = 40). h, Bar chart displaying TE mapping 


piRNA levels in w 


1118 


or in nibbler~’~ ovaries (values normalized to 


1 million sequenced miRNA reads). i, j, Length profiles of TE mapping 
small RNA reads obtained from ovaries of indicated genotypes. Shown 
are all ovarian small RNAs (i) or Piwi-bound piRNAs defined as soma- 
enriched (j; see Methods). Displayed are fractions of reads of indicated 
length as a percentage (mean lengths are indicated below). 
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Extended Data Figure 4 | A small RNA library cloning approach that 
allows the recovery of longer piRNA species. Drosophila total RNA 
contains large amounts of the 30-nt long 2S rRNA. Previous cloning 
approaches therefore typically restrict small RNA cloning to the 18-29 nt 
window by cutting these small RNA populations from a gel. We used a 
previously published 2S rRNA depletion method”, followed by extracting 


small RNAs ranging from 18-40 nt in length for library preparation. 
Shown are size distributions of TE mapping small RNAs (obtained from 
w'!!8 ovaries) comparing the standard small RNA cloning protocol (left) 
and the protocol using total RNA depleted of 2S rRNA (middle; see 
Methods). An overlay of the longer reads (>27 nt) is displayed to the right. 
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Extended Data Figure 5 | Zucchini and Nibbler generate Aub- and 
Ago3-bound piRNA 3’ ends independently of piRNA 5/-end formation. 
a, Box plots (***P < 0.001 by two-sided t-test) showing the 3’-end 
definition (see Methods) of Ago3-, Aub-, and Piwi-bound piRNAs isolated 
from ovaries of the indicated genotypes. Soma-enriched Piwi-bound 
piRNAs (see Methods) are shown in boxes with diagonal lines. b, Stacked 
bar plots displaying the nucleotide composition at the 5’ end or position 
10 of piRNAs bound to Aub-, Ago3-, or Piwi (isolated from ovaries of 
indicated genotypes). The plots show the composition within ten equally 


sized bins, sorted for their 3’ end precision index (see Methods). c, Box 
plots (***P < 0.001 by Wilcoxon rank-sum test) showing the R? values 

of the comparison between the 3’-end profiles of Ago3- or Aub-bound 
piRNAs from w!"/ ovaries and those of nibbler~/~ ovaries (Zucchini 
only), those of ovaries depleted of Zucchini (Nibbler only), and those of 
the calculated composite that provides the highest R? (best fit). Midlines 
in a and c indicate the median value, box ranges from the first to the third 
quartile, whiskers are 1.5x the interquartile range 
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Extended Data Figure 6 | Ago3 incorporates more TE antisense 
piRNAs in nibbler mutant flies. a, b, Scatter plots displaying the strand 
bias (antisense divided by sense) of piRNA populations in w!1/8 versus 
nibbler~/~ ovaries. a, piRNAs bound to Piwi, Aub, or Ago3. b, piRNAs 
from total ovarian RNA. c, Z-scores of 5'/5’ ping-pong levels per TE 


(63 germline dominant TEs) in w!!!8 versus nibbler~/~ ovaries. 
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d, Grouping of TEs (63 germline dominant TEs) based on fold change in 
piRNA levels between Zucchini-depleted and control ovaries (left). Box 
plot indicates Aub/Ago3-bound piRNA levels in wild-type ovaries for 
defined TE groups (midline indicates the median value, box the first and 
third quartiles, whiskers are 1.5x the interquartile range; ***P < 0.001 
after Wilcoxon rank-sum test). 
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contro! depletion (RPKM) 


Nibbler double-depleted ovaries. a, Heat map displaying fold changes 

in antisense piRNA levels per TE (n = 63) in indicated genotypes versus 
control (six biological replicates). Dots mark TEs, which lose piRNAs only 
in Zucchini/Nibbler-depleted ovaries. b, Mappings (5’ ends only; red, 
sense; black, antisense) of piRNAs onto the GATE and Doc TE consensus 
sequences. The plots at the top are from a piRNA library obtained from 
Zucchini-depleted ovaries, and the ones at the bottom are from a piRNA 
library obtained from Zucchini/Nibbler double-depleted ovaries. The 


Zucchini depletion (RPKM) 
Extended Data Figure 7 | piRNAs are abundantly produced in Zucchini/ 


~100-bp window of Doc that is detailed in Fig. 4a is depicted by brackets. 
c, Size distributions of TE-mapping small RNAs obtained from total small 
RNA libraries from Zucchini-depleted (top) or Zucchini/Nibbler-depleted 
(bottom) ovaries. Shown are the average values from six biological 
replicates (reads were normalized to 1 million sequenced miRNA reads). 
d, Scatter plots displaying the steady-state RNA level of TEs in indicated 
genetic background (only TEs with RPKM >1 in either background are 
shown; n= 74). 
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Extended Data Figure 8 | Precise coupling of neighbouring ping-pong 
piRNAs in Zucchini/Nibbler-depleted ovaries. a, Immunofluorescence 
images (confocal sections) of egg chambers of indicated genotype stained 
for endogenous Piwi protein (scale bars, 101m). Note that the shRNA- 
mediated knockdown is specific for germline cells. Somatic follicle 

cells that also express Piwi therefore serve as control. b, Length profiles 
(fractions of reads per indicated length as a percentage) of TE-mapping 
piRNAs bound to Aub/Ago3 in indicated genotypes. c, Box plot (midline 
indicates the median value, box the first and third quartiles, whiskers are 
1.5x the interquartile range) displaying the distribution of Z-scores for 
canonical 5'/5’ ping-pong of piRNA populations isolated from ovaries of 
indicated genotypes. The analysis is restricted to the 11 germline enriched 
TEs that maintain piRNA production in Zucchini/Nibbler-depleted 


ovaries (see Methods). Midline indicates the median value, box ranges 
from the first to the third quartile, whiskers are 1.5 the interquartile 
range. d, Histogram showing the frequencies of a cloned ping-pong 
piRNA 5’ end downstream of a responder piRNA 5’ end at a certain 
distance on the same strand. e, Sequence logos displaying the nucleotide 
composition within and downstream of Aub- and Ago3-bound piRNAs 
cloned from ovaries of indicated genotypes. 5’-end position and position 
10 are measured from the piRNA 5’ end. 3’-end position is the dominant 
3‘ end of a certain piRNA 5’ species. Downstream positions are anchored 
by the dominant 3’-end position. Nucleotide signatures with respect to the 
position of sense and antisense slicer piRNAs are depicted in the cartoons 
below. 
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Extended Data Figure 9 | piRNAs generated by slicer/slicer are slicer sites. Mappings were normalized to 1 million sequenced miRNA 


methylated. a, Shown are the mappings of those piRNAs (5/ ends in black; _ reads. b, An oxidized library control of the heat map shown in Fig. 4d, 
3’ ends in grey) that were probed by northern blots in Fig. 4f. The 5’ ends showing the precise coupling of piRNA 5’ ends (blue) and 3’ ends (red) in 
of antisense piRNAs (red) map precisely 10 nt downstream of the predicted | Zucchini/Nibbler-depleted ovaries. 
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Extended Data Figure 10 | Conservation of Zucchini/MitoPLD, PARN, PNLDC1, and Nibbler/Mut-7 across metazoa. 328 metazoan species are 
displayed in a phylogenetic tree (generated using iTOL). The presence of indicated orthologues in each species is marked (black, Zucchini/MitoPLD 
green, PARN; blue, PNLDC1; red, Nibbler/Mut-7). Taxonomic groups mentioned in the text are highlighted (for details see Methods). 
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An oxidative N-demethylase reveals PAS transition 
from ubiquitous sensor to enzyme 


Mary Ortmayer', Pierre Lafite!+, Binuraj R. K. Menon!, Tewes Tralau!+, Karl Fisher!, Lukas Denkhaus!, Nigel S. Scrutton!, 


Stephen E. J. Rigby!, Andrew W. Munro!, Sam Hay! & David Leys! 


The universal Per-ARNT-Sim (PAS) domain functions as a signal 
transduction module involved in sensing diverse stimuli such as 
small molecules, light, redox state and gases!”. The highly evolvable 
PAS scaffold can bind a broad range of ligands, including haem, 
flavins and metal ions. However, although these ligands can support 
catalytic activity, to our knowledge no enzymatic PAS domain has 
been found. Here we report characterization of the first PAS enzyme: 
a haem-dependent oxidative N-demethylase. Unrelated to other 
amine oxidases, this enzyme contains haem, flavin mononucleotide, 
2Fe-2S and tetrahydrofolic acid cofactors, and specifically catalyses 
the NADPH-dependent oxidation of dimethylamine. The structure 
of the « subunit reveals that it is a haem-binding PAS domain, 
similar in structure to PAS gas sensors*. The dimethylamine 
substrate forms part of a highly polarized oxygen-binding site, and 
directly assists oxygen activation by acting as both an electron and 
proton donor. Our data reveal that the ubiquitous PAS domain can 
make the transition from sensor to enzyme, suggesting that the PAS 
scaffold can support the development of artificial enzymes. 

The PAS domain has been found in all kingdoms of life, functioning 
as a signal transducer that couples ligand binding to a cellular signalling 
response’”. The highly divergent PAS family couples a remarkable plas- 
ticity in binding ligands to an equally wide variety of sensory stimuli. 
The typical PAS fold consists of a single antiparallel, five stranded 
B-sheet with intervening a-helices that create the ligand-binding 
pocket on the 8-sheet surface. Despite possessing structural similarity, 
PAS domains often have low sequence similarity, with no universally 
conserved residues. This suggests a high adaptability, which has 
been experimentally verified for photoactive yellow protein*. Despite 
PAS domains binding cofactors with inherent chemical reactivity, no 
enzymatic activity has been associated with the PAS family. By contrast, 
other folds have made the transition to enzyme, such as globins>®. 
Given the remarkable plasticity and universal occurrence of PAS, this 
might suggest an inherent limitation to developing enzymatic activity. 
Understanding of such limit(s) is key to the emerging field of artificial 
enzyme development, which often makes use of ligand-binding scaffolds’. 

We show that an orphan enzyme previously described as a secondary 
amine mono-oxygenase®”, but here referred to as haem-dependent 
oxidative N-demethylase (HODM), contains a haem-dependent PAS- 
scaffold that serves as the site of the oxidative N-demethylase activity. 
HODM functions in bacterial methylated amine catabolism, linking 
alkylamine oxidation to the tetrahydrofolate C1 pool!®. Methylated 
amines are formed through various processes" and the trimethylamine 
degradation pathway has been outlined since the 1970s (Extended Data 
Fig. 1a). Previous spectroscopic characterization of HODM revealed 
that it combines features typically associated with haem proteins as 
diverse as globins, cytochromes P450 and cytochrome bs (refs 8, 9). 
The NADPH-dependent dimethylamine (DMA) oxidation reaction 
catalysed by HODM appears similar to a typical cytochrome P450 
N-dealkylation reaction”, although HODM lacks the characteristic 


P450 cysteine ligation. Whether the enzyme catalyses a P450-like 
mono-oxygenation has not been established. Until recently, further 
studies have been hampered by the lack of sequence information”. 

We expressed the operon proposed to encode for HODM from 
the proteobacterium Pseudomonas mendocina in Escherichia coli. 
This includes a gene coding for a DUF3445 family representative of 
unknown function (42 kDa, a), a phthalate dioxygenase reductase-like 
FMN/2Fe-2S dependent oxidoreductase (39 kDa, 3), a glycine cleavage 
T-protein (34kDa, 4) and a smaller protein of unknown function 
(19kDa, 6 subunit) (Fig. la). The purified gene products form a 
heterotetrameric enzyme (Extended Data Fig. 1a) with biochemical and 
spectral properties similar to the Aminobacter aminovorans HODM®? 
(Fig. 1). 

Electron paramagnetic resonance spectroscopy of anaerobically 
isolated HODM reveals signals for all redox centres, including the 
presence of a NO-bound ferrous haem (Fig. 1b-f). HODM catalyses 
the NADPH-dependent oxidative N-demethylation of DMA with a 
Keat of 9.2 £0.5s~! and a KP™4 of 24+6)1M, resulting in the release 
of one equivalent of formaldehyde under in vitro conditions (that is, 
no THF present; Fig. 1g). The presence of the 7 subunit significantly 
lowers the DMA-dependent formaldehyde release in vivo (Extended 
Data Fig. 1b). This suggests substrate channelling occurs between the 
a and + subunits, as observed for other amine oxidases!*)5, Binding 
of oxygen to reduced HODM results in globin-type spectral changes 
(Fig. 1h), and the resulting ferrous—oxygen complex is remarkably 
stable (half-life of approximately 50 min, Extended Data Fig. 1c), 
probably linked to the high haem redox potential (Extended Data 
Fig. 1d). DMA binding perturbs the haem spectrum, and DMA affinity 
is significantly higher for the ferrous—-oxygen complex (Ky= 15 +3 1M) 
(Extended Data Fig. le, f). 

DMA binding to anaerobically purified HODM leads to the loss 
of superhyperfine coupling to the '*N atom of the His ligand in the 
NO-ferrohaem spectrum (Extended Data Fig. 2), probably a conse- 
quence of a change in the relative orientation of the NO and histidine 
ligands to the haem iron!®!”. This suggests DMA binding affects the 
NO ligand, in line with the allosteric effect of oxygen binding on DMA 
affinity. DMA binding also leads to a change in the g values exhibited 
by the [2Fe-2S]'* cluster (Extended Data Fig. 2d), suggesting commu- 
nication between the 8 and a subunits. These effects only occur with 
DMA and not with poor substrates (Extended Data Fig. 2e-h). 

While NADPH-dependent flavin reduction occurs rapidly 
(k= 304+ 25s7! under anaerobic conditions; Fig. 2a), haem reduc- 
tion is extremely slow in the absence of both DMA and oxygen 
(k=0.059 + 0.005 min“ |; Fig. 2b). Haem reduction remains slow in the 
presence of either oxygen (leading to the ferrous—O complex) or DMA 
alone (leading to the ferrous-DMA complex), but rapid mixing of the 
HODM ferrous-oxygen—DMA complex (containing oxidized flavin 
mononucleotide (FMN)/[2Fe-2S]) with NADPH under anaerobic 
conditions reveals that the presence of substrate enhances the initial 


1Manchester Institute of Biotechnology, School of Chemistry, 131 Princess Street, University of Manchester, Manchester M1 7DN, UK. +Present addresses : University of Orléans, CNRS, ICOA, UMR 
7311, F-45067 Orléans, France (P.L.); German Federal Institute for Risk Assessment (BfR), Department of Chemicals and Product Safety, Max-Dohrn Str. 8-10, 10589 Berlin, Germany (T.T.). 
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Figure 1 | Biophysical characterization of P. mendocina HODM. 

a, Schematic overview of the HODM reaction. b-f, X-band continuous 
wave EPR spectra of: anaerobically purified HODM (b); HODM exposed 
to air (c); the difference spectrum formed by the subtraction of c from b 
showing rhombic [2Fe-2S]!* and flavin semiquinone contributions*” (d); 
anaerobically purified HODM spectrum around g= 2.00 run under 
conditions designed to emphasize the neutral flavin semiquinone signal at 


B reduction rate by >1,000 fold, and the ferrous-oxygen-DMA 
complex decay rate by 2.5-fold (Fig. 2c—e). A rate enhancement for the 
8 subunit reduction and intramolecular electron transfer to the ferrous-— 
oxygen-DMA (as opposed to the ferrous—-oxygen) complex is probably 
linked to the observed effects of DMA binding on the [2Fe-2S]'* 
cluster. However, the ferric to ferrous haem iron transition under 
anaerobic conditions remains around 100 times slower (both with and 
without DMA) compared to the kea: of 9.240.551. 

Following the aerobic mixing of HODM with excess DMA and 
NADPH, steady-state conditions are established, during which the 
enzyme remains predominantly in the ferric-DMA-bound state (Fig. 2e). 
This finding suggests that intermolecular electron transfer is rate- 
limiting, supported by the fact that no significant kinetic isotope 
effect (KIE) on the NADPH oxidation rates could be observed using 
perdeuterated DMA or D,O. The mechanism that underpins the rate 
enhancement observed only in the presence of both DMA and O, 
remains unclear at this stage. 

As observed for holo-HODM, reduction of the isolated « subunit 
is a slow process. Mixing of the ferrous-O,-DMA a subunit with 
NADH and phenazine methosulfate under anaerobic conditions leads 
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g=2.00 in d (e); and potassium ferricyanide oxidized HODM (f). 

g, DMA-dependent NADPH oxidation (kca,= 9.2 £0.58 1 and 

Ky>M4 = 24 + 611M). The inset reveals stoichiometric conversion of DMA 
to MA and formaldehyde (errors bars are s.e.m., n =3). h, UV-visible 
absorption features of 8.511.M HODM in the ferric (thick solid line), 
ferrous (thin solid line), Fe?+-Oy (dotted line) and Fe**-CO (dashed line) 
states. 


to accumulation of the ferrous-DMA enzyme via transient formation 
of the ferric-DMA species (Fig. 2f). Low nM (sub-stoichiometric) 
levels of formaldehyde can be detected following this experiment, 
confirming that the a subunit is catalytically active. Although certain 
haem enzymes can make use of peroxide to bypass the initial oxygen 
reduction step, no spectroscopic evidence for higher Fe oxidation 
species could be obtained in the presence of DMA and peroxide, 
nor was formaldehyde formation observed under these conditions 
(Extended Data Fig. 3). 

Unfortunately, we were unable to determine the structure of the 
HODM holoenzyme. However, the isolated « subunit responsible 
for haem, oxygen and DMA binding has spectral and ligand binding 
properties nearly identical to those of the holoenzyme (Extended Data 
Fig. 4), is catalytically competent and amenable to crystallization. The 
1.7 A resolution « subunit structure reveals that it consists of an N- and 
C-terminally extended PAS haem binding domain (Fig. 3), most similar 
in structure to the Aer2 PAS domain, a gas-sensing receptor protein 
controlling chemotaxis in Pseudomonas aeruginosa'* (PDB code 4HI4; 
2.25A rm.s.d. over 85 Ca atoms, Z score 5.6, 12% homology; Fig. 3b). 
The HODM PAS domain (residues 145-231) contains the canonical 
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Figure 2 | NADPH-dependent oxidation of DMA by HODM. a, UV- 
visible spectral evolution (over 1s) from the anaerobic mixing of ferric 
HODM (5.3|1M) with NADPH. Difference spectra (inset) reveal the 

+ subunit flavin is rapidly reduced. No haem reduction is observed. b, As 
in a, but monitored over 3 min. Arrows indicate absorbance changes from 
ferric (409 nm, thick line) to ferrous (429 nm, thick line). The Ayz9 — Aago 
signal (inset) is fitted to an a—b model; k= 0.059 0.005 min“! (errors 
bars are s.e.m., 1 = 3). c, UV-visible spectral evolution (over 20s) from 
the anaerobic mixing of ferrous-oxy HODM (41M) with 200 1M NADPH 
(in the presence of 2mM DMA). An a—+b-+c model was fitted with 

k, =2.58+0.05s ' for ferrous-oxy-DMA (black line) to ferric (grey line) 
and k, = 0.087 + 0.007 s~! for the ferric-DMA to ferrous-DMA (dotted 
line) transition. Similar experiments over a 1s period revealed these steps 
are preceded by flavin reduction (k= 149 +20s~'). d, UV-visible spectral 


AB, BB, GB, HB, and I3 PAS 3-sheet as well as the core Da, Ea and Fa 
elements. Similar to Aer2, HODM includes an extended HB-I8 loop 
and a shifted Fa helix not seen in other haem-binding PAS domains. 
The HODM En] His194 acts as the proximal haem ligand, comparable 
to Aer2 En His234. However, HODM lacks the core PAS-domain Ca 
helix that encompasses the Aer2 haem pocket, and features extended 
B-strands as well as a longer G3-HB loop. The N- and C-terminal 
extensions to the central HODM PAS domain cover the PAS 3-sheet 
on the side opposite to the haem binding pocket. 

Although the stoichiometry of the overall reaction catalysed by 
HODM bears some similarity to the cytochrome P450 enzymes!”, 
HODM does not contain a Cys haem-thiolate ligand that assists 
with oxygen-oxygen heterolytic bond cleavage to form compound I 
(the so-called ‘push’ effect!*). By contrast, many haem peroxidases 
also contain a His ligand and have been shown to contain a highly 
polarized distal active site that achieves the same feat (referred to as 
the ‘pull’ mechanism!*). We determined the structures of HODMa 
in complex with DMA (to 1.75 A resolution, Fig. 3d) and with both 
DMA and NO (a mimic for the DMA-O, complex; to 1.9 A resolution, 
Fig. 3e). DMA binding displaces two water molecules and occurs in 
close proximity of the distal haem ligand (Fig. 3d). Direct electrostatic 
interactions between the DMA substrate and the enzyme are made 
with the conserved HB Glu266, in addition to van der Waals inter- 
actions between the substrate methyl groups, Trp180 and the haem 
porphyrin. NO binding to the haem iron occurs at a distance of 1.95 A 
(Fig. 3e). The oxygen atom of NO (mimicking the distal 6 oxygen of 
O>) is within hydrogen bonding distance of the conserved G8 Arg224 
and H6 Gln268, as well as the DMA amine group (Fig. 3e). This is 
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Wavelength (nm) 


evolution (over 100s) from the anaerobic mixing of ferrous-oxy HODM 
(441M) with 200 1M NADPH (in the absence of DMA). Fitted to an 
a—b—c—d model with k; = 0.09 + 0.02 s“! (initial 8 subunit reduction; 
black line to grey line), ky = 1.0 0.2s! for the ferrous-oxy-DMA to 
ferric (dotted line) and k; = 0.03 +0.01s~ for the ferric to ferrous (dashed 
line) transition. e, An overlay of the HODM spectrum under steady-state 
conditions (enzyme in the presence of excess NADPH, O, and DMA; black 
line) with the ferric DMA-bound resting state (grey line). f, UV-visible 
spectral evolution (over 1.6s) from the anaerobic mixing of ferrous-oxy 
HODM « subunit (7.4|1M) with NADH in the presence of 75 1M 
phenazine methosulfate and 2mM DMA. Fitted to an ab-—c model with 
k, =0.24+0.07s~! (ferrous-oxy-DMA to ferric) and k, = 0.09 + 0.02 s“! 
(ferric-DMA to ferrous-DMA). 


consistent with the approximately 90-fold higher DMA affinity of the 
ferrous-oxy complex compared with ferrous HODM. Substitution of 
Trp180, Arg224 or Glu266 all lead to enzyme inactivation (data not 
shown). Although oxygen binding is detected for W180A, E266Q and 
R224<A variants, no DMA binding is detected for E266Q, while direct 
haem ligation by DMA occurs in the R224A variant (Extended Data 
Fig. 4c-e). 

The structural arrangement of the HODM haem subunit active site 
shares the polarized nature of the haem peroxidases”: the conserved 
Arg224, the amine substrate (presumably charged when bound to 
conserved HG Glu266) and the conserved Gln268 each contribute 
a (partial) positive charge to the distal oxygen. On the other hand, 
the HODM proximal ligand environment is similar to that observed 
in globins, and a peroxidase-like interaction with an Asp to enhance 
electron-donating character is not observed. The latter is consistent 
with the stable O2 complex and the positive redox potential. A key 
question is whether HODM achieves oxidative N-demethylation 
through species with higher Fe oxidation states, such as the highly 
reactive compound I (as typified by cytochrome P450 enzymes!”) or 
compound II (as occurs in peroxidases”’). 

We propose that, following DMA binding, the 8 subunit reduces the 
ferric haem iron to ferrous using NADPH, leading to dioxygen binding 
and formation of the ferrous-oxy/ferric-superoxy form (Fig. 4). Transfer 
of the second electron and formation of the ferric hydroperoxide 
complex (compound 0) is probably coupled to proton transfer from 
the DMA substrate. This particular step explains the strict requirement 
of this enzyme for amine substrates. Following deprotonation of the 
substrate concomitant with compound 0 formation, Arg224 is likely 
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Figure 3 | Crystal structure of the P. mendocina HODM a subunit. 

a, Cartoon representation of HODMa, with the N-terminal and the 
C-terminal extensions coloured in green and pink respectively. The central 
PAS domain is coloured according to secondary structure. The haem 
cofactor is shown in atom coloured sticks. b, A side-by-side comparison 

of the HODM PAS domain (left panel) with the PAS oxygen sensor Aer2 
(right panel). Colour coding as in a. c, A view of the HODM a subunit 
active site in the ferric state, with key residues shown in atom coloured 


to act as the acid required to catalyse O-O scission. Oxidation of the 
substrate and O-O scission can, in principle, occur in a variety of 
ways (Extended Data Fig. 5), possibly including Fe(V) and/or Fe(IV) 
oxidation states. Both mono-oxygenase (the direct incorporation of 
oxygen, leading to hydroxymethylamine) as well as amine oxidase 
activity (leading to an imine product) appear possible, but are not 
readily distinguishable as both products rapidly convert to formal- 
dehyde and mono-methylamine. 

While ample spectroscopic evidence has been presented for such 
reaction mechanisms in other haem-dependent enzymes!??!, we 
have found no evidence of either compound I or II during turnover 
or rapid mixing experiments with HODM. Similarly, X-ray-induced 
photoreduction of ferrous-O,-DMA crystals at 100 K led to forma- 
tion of the ferrous species with no indication of higher Fe oxidation 
states (Extended Data Fig. 6). Our inability to observe transient, higher 
haem Fe oxidation states does not exclude the possibility that HODM 
operates via such species. However, the direct alignment of the sub- 
strate N-methyl group with bound oxygen also allows for direct hydride 
transfer from the methyl group to the compound 0 proximal oxygen. 
Coupled with heterolytic O-O scission and protonation of the distal 
oxygen by Arg 224, this mechanism does not require formation of a 
higher Fe oxidation state (Fig. 4). A direct hydride transfer mechanism 
has been proposed for the nitric oxide reductase P450nor”’, while the 
direct addition of Cl to Fe(V)=O in chloroperoxidases has some 
similarity”. 

Alternatively, proton transfer from Arg224 could be coupled to 
single-electron transfer from the substrate, creating compound II anda 
substrate amine radical. In contrast to the hydride transfer mechanism, 
the radical species can result in either the N-oxide or iminium product. 
It is interesting to note that, unlike DMA, methylamine (MA) is 
oxidized without the formation of significant levels of formaldehyde 
(Extended Data Fig. 7). This suggests that HODM is unable to form 
the MA-derived imine product. As the MA-derived reaction inter- 
mediate(s) are similar in energy to their DMA-derived counterparts 
(Extended Data Fig. 8), this suggests incorrect positioning of the 
single methyl group of MA, leading to formation of the N-oxide product. 
Distinct active and inactive methyl binding pockets have been 
implicated in the sequential N-demethylation process catalysed by 
cytochrome P450 2B4”*. 
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sticks. Sequence alignment reveals H194, R224 and F281 are strictly 
conserved, while other positions contain conservative mutations: W or F 
occur at 180, E or D at 266 and Q or Eat 268. The 2F, — F, electron density 
map is contoured at 1o (blue mesh). d, HODMa active site in complex 
with the DMA substrate. The F, — F, omit density corresponding to DMA 
is contoured at 40 (green mesh). e, HODMca active site in complex with 
the DMA substrate and NO. The F, — F, omit density corresponding to 
DMA and NO is contoured at 4c (green mesh). 


Our data confirm that the PAS fold can accommodate enzymatic 
activity, which suggests that other PAS enzymes are likely to exist. 
Furthermore, it points to the fact that existing PAS domains can be 
evolved to catalyse reactions: engineered variants of a PAS oxygen 
sensor have been shown to display haem oxygenase activity”. 
N-dealkylation is a widespread reaction, catalysed by cytochrome P450 
enzymes'”, FAD/quinone amine oxidases**”’ and «-ketoglutarate- 
dependent Fe oxygenases”*. HODM represents a distinct catalyst, which 
we propose acts as an oxidase. Unlike the more widespread flavin amine 
oxidases, HODM oxidizes both NADPH and the amine substrate to 
convert oxygen to water (as opposed to peroxide). DMA oxidation 
sensitive to CO has been shown to occur in yeasts’, and this new 
type of amine oxidase also appears in eukaryotes, with HODMa 
homologous genes present in a range of genomes (Extended Data 
Fig. 9). 
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Figure 4 | HODM is a haem-dependent amine oxidase. Following 
formation of compound 0, DMA oxidation can occur through a radical 
mechanism (red arrows) involving transient formation of the Fe(IV) 
oxidation state (Compound II) or directly through H-transfer (blue 
arrows), bypassing the need for higher Fe oxidation states. 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized and the investigators were not blinded to allocation during 
experiments and outcome assessment. 

Materials. All materials were obtained from Sigma-Aldrich unless otherwise stated. 
Cloning, mutagenesis and heterologous expression in E. coli. For expression 
of an active HODM module, the gene encoding the haem « subunit (Pmen_3455 
in the Pseudomonas mendocina ymp genome) and 6 subunits (Pmen_3453) were 
synthesized and co-expressed from the pCOLADuet-1 plasmid (Novagen). The 
+ (Pmen_3456; glycine cleavage T protein - aminomethyl transferase) and 8 subunits 
(Pmen_3454; ferredoxin) were synthesized and expressed from the pETDuet-1 
plasmid for a two-plasmid expression system for all four subunits (a, 8, y and 8). 
The « subunit bears a C-terminal His, tag, the 6 subunit bears an N-terminal His, 
tag, whereas the ¥ and 3 subunits are untagged. For haem domain isolation, the 
q subunit was expressed from the pET28a plasmid with a C-terminal His, tag 
(Novagen). Site-directed mutagenesis of the « subunit within the pPCOLADUET-1 
vector was performed to include a Trp180Ala mutation (538-540 bp, TGG to 
GCG), a Arg224Ala mutation (670-672 bp, CGT to GCT) and a Glu266Gln muta- 
tion (796-678 bp, GAA to CAA). HODM variants were expressed in E. coli BL21 
(DE3) cells (Novagen). E. coli transformants were grown in 112YT supplemented 
with 25,.g ml! kanamycin and 100,.g ml! ampicillin) and incubated at 37°C 
180r.p.m. until the culture reached an ODgo9 of 0.6. Protein expression was then 
induced with addition of 0.5 mM IPTG. Cultures were grown overnight at 30°C, 
1801.p.m. and then harvested by centrifugation (7,000g for 10 min). Cell pellets 
were resuspended in buffer A (100 mM Tris, 200 mM KCl pH 7.5) supplemented 
with DNase, lysozyme (Sigma-Aldrich) and a complete EDTA-free protease 
inhibitor cocktail tablet (Roche). Cells were lysed using a French press at 1,500 p.s.i. 
and the lysate clarified by centrifugation at 125,000g for 90 min. The supernatant 
was applied to a Ni-NTA agarose column (Qiagen). The column was washed with 
three column volumes of buffer A supplemented with 10 mM imidazole and the 
protein was eluted in 1 ml fractions with buffer A supplemented with 300 mM 
imidazole. Imidazole was removed using a 10-DG desalting column (Bio-Rad) 
equilibrated with buffer A. The HODM haem domain was further purified using 
Q-Sepharose resin at pH 8 (GE Healthcare) with a 100-500 mM elution gradient 
(Rz~ 1.2). For crystallization studies, the purified protein was subjected to size- 
exclusion chromatography using a Superdex 200 column (GE Healthcare) in 
50mM KPi, 250mM KCl (pH 7.5). Protein was concentrated as required using a 
Vivaspin centrifugal device (Sartorius). 

Haemoprotein concentration estimations. The pyridine haemochromogen 
method was used to quantify HODM haem and to determine an extinction 
coefficient at the Soret maximum (€423=94mM~! cm7!) according to the method 
of Berry and Trumpower*!. 

UV-visible absorbance studies. UV-visible absorbance analysis was carried out 
using a Cary UV-50 UV-visible scanning spectrophotometer (Varian) using a 
1cm path length quartz cuvette, recording spectra between 250 and 700 nm, and 
typically with HODM at 81M in buffer A. Potassium ferricyanide was used to 
chemically oxidize HODM and O, binding experiments were performed in an 
anaerobic glove box (Belle Technology, Weymouth, UK) under a nitrogen atmos- 
phere with O, levels maintained at less than 2 p.p.m. All solutions were deoxy- 
genated by sparging with nitrogen gas. To obtain the ferrous dioxygen complex, 
HODM (holoenzyme or the haem subunit in isolation) was reduced by sodium 
dithionite, followed by slow bubbling of air into the reaction mixture. The rate of 
decay of the ferrous—O, complex was determined by continual scanning every 
~2 min over ~6 hours. Half-life values were derived by plotting the induced optical 
change against time and fitting using an exponential decay equation using Origin 
software (OriginLab, Northampton, MA). Rate of decay values were derived using 
a first order rate equation. 

Ligand binding titrations. Optical titrations to determine Ky values for DMA were 
performed at 25°C using HODM at ~8|.M in buffer A. DMA stocks were made 
up in the same buffer and added volumes did not exceed 0.1% of the total volume. 
Kg values were determined by plotting the induced optical change against ligand 
concentration and by fitting using a hyperbolic equation within Origin software, or 
(for tight-binding ligands) using the Morrison (quadratic) equation*” (OriginLab, 
Northampton, MA). 

NADPH spectrophotometric activity assays. At each concentration of DMA 
(0-600 1M), 2001.M NADPH was added to initiate the reaction and the rate of 
change in absorbance at 340 nm monitored over 1 min at room temperature to 
follow substrate-dependent NADPH oxidation. Reactions were performed in 
triplicate to produce a mean rate calculated as moles of NADPH oxidized per mol 
HODM per second. These data were plotted against the relevant DMA concen- 
tration and fitted using the Michaelis-Menten hyperbolic function within Origin 
software. Similar experiments were conducted using perdeuterated DMA or in 
D,0, revealing no significant differences in NADPH oxidation rates. 


In vitro formaldehyde detection using Purpald. For the detection of substrate- 
dependent formaldehyde production, the Zurek and Karst method*? was used, 
however with a Purpald solution of 1 mg ml! in 1M NaOH. HODM activity 
reactions were set up as described above with an enzyme concentration of 200nM 
with 250,1.M DMA and allowed to react for 5 min at room temperature. Microtitre 
plate assays were performed using a 96-well plate reader (BioTek Synergy HT, 
Biotek). 100 il of Purpald solution was mixed with 100 il of sample and absorbance 
measurements were carried out after a reaction time of 60 min at a wavelength of 
550nm. A formaldehyde standard curve was calculated at known concentrations 
of formaldehyde. Reactions without HODM or NADPH result in the loss of for- 
maldehyde formation. The assay underestimated this loss owing to background 
absorbance from the standard curve and so data were normalized accordingly. 
In vivo formaldehyde detection. Enzymatically catalysed formaldehyde produc- 
tion was assayed in vivo as reported previously". In brief, the respective assay makes 
use of a UV-light-sensitive GFP (excitationmax = 395 nm, emissionmax = 509 nm) 
fused to FrmR as sensor and is hosted in E. coli W3GM as a formaldehyde-sensitive 
host. Following transformation with expression constructs of formaldehyde- 
producing enzymes, this reporter strain allows quantitative measurement of cyto- 
solic formaldehyde levels in vivo. The corresponding assays were performed using 
a 96-well microtitre plate reader (BioTek Synergy HT, Biotek, USA) equipped with 
a 360p;/528pm nm filter set with readings taken in the early stationary phase (22 to 
24h after inoculation). The data were corrected for background fluorescence of 
uninduced controls and adjusted for cell density and enzyme activity as necessary. 
All assays were done as sixfold biological replicates at least, and the results were 
found to be statistically significant within a confidence interval of 3%. Low levels 
of formaldehyde were measured colourimetrically with acetylacetone/ammonium 
acetate as described*4. Aliquots were taken (0-1 ml) and made up to a final volume 
of 1 ml with water and combined with 0.03 ml 7.5 M HCl and 0.25 ml ofa 4x Nash 
reagent solution, prior to incubation at 38 °C for 90 min. Samples were centrifuged 
for 5 min at 14,000 r.p.m. in a microfuge and the A4;4 determined in a 1-cm cell. 
Electron paramagnetic resonance (EPR) analysis. Continuous wave EPR spectra 
for HODM were recorded at X-band (~9.4 GHz) using a Bruker ELEXSYS E500/ 
E580 EPR spectrometer (Bruker GmbH, Rheinstetten, Germany). Temperature was 
maintained using an Oxford Instruments ESR900 helium flow cryostat coupled to 
an ITC 503 controller from the same manufacturer. The experimental conditions 
employed to obtain individual spectra are given in the figure captions. EPR sample 
tubes were 4mm Suprasil quartz supplied by Wilmad (Vineland, NJ). 

Redox potentiometry. To determine the midpoint potential for the HODM haem 
domain Fe!"/Fe!! couple, a spectroelectrochemical titration was performed in an 
anaerobic glove box (Belle Technology, Weymouth, UK) under a nitrogen gas 
atmosphere. All solutions were degassed under vacuum with nitrogen prior to use 
in the glove box. Oxygen levels were maintained at less than 5 p.p.m. The concen- 
trated HODM haem domain protein sample was passed through a Sephadex G25 
column (1 x 20cm) (10DG column, Bio-Rad, Hemel Hempstead, UK) immediately 
on admission to the glove box to remove all traces of oxygen. This column was 
pre-equilibrated and proteins were buffer-exchanged into anaerobic redox 
buffer (100 mM Tris, 200 mM KCI, pH 7.0). The HODM haem domain solution 
(~10-15,1M enzyme in 5 ml buffer), was titrated electrochemically according 
to the method of Dutton*, using sodium dithionite as reductant and potassium 
ferricyanide as oxidant. Mediators were added to expedite electronic equilibration 
in the system (2\1M phenazine methosulfate [PMS], 51M 2-hydroxy-1,4- 
naphthoquinone [HNQ], 51M methyl viologen [MV], and 1 1M benzyl viologen 
[BV]) and to mediate in the range between approximately +100mV and —480 mV 
versus the normal hydrogen electrode (NHE)**. 10-15 minutes were allowed to 
elapse between each addition of reductant/oxidant to ensure equilibration and 
stabilization of the electrode. Spectra (250-750 nm) were recorded using a Cary 
UV-50 Bio UV-Visible scanning spectrophotometer coupled to a fibre optic probe 
immersed in the HODM haem domain solution. The potential was measured using 
a SevenEasy S20-K meter (Mettler Toledo, Leicester, UK) coupled to a Calomel 
electrode (ThermoRussell, Cupar, UK) at 25°C. The calibration of the electrode 
was done by using the Fe!!/Fe!! EDTA couple as a standard (+108 mV). The 
electrode reading was corrected by +244 mV relative to the NHE. Absorbance 
data at wavelengths reporting on the transition of the haem Soret band between 
oxidized (Fe!!”) and reduced (Fe!) forms of the HODM haem domain were plotted 
versus applied potential, and the data were fitted using the Nernst function in 
Origin 8.5 (OriginLab, Northampton MA). 

Stopped-flow spectroscopy. Stopped-flow absorbance measurements were made 
using an Applied Photophysics SX18 MVR stopped-flow spectrophotometer in 
an anaerobic glove box (Belle Technology Weymouth, UK). The drive syringes 
were loaded with separate solutions of ferric or ferrous-oxy protein (generation 
of this complex with aerobic buffer is described above) and varying concentra- 
tions of reductant in buffer A in the presence and absence of DMA (260,1M). 
Multiple wavelength data were collected at room temperature using a photodiode 
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array (PDA) detector and XSCAN software. The resulting spectra were subjected 
to global analysis using SpecFit/32 software (Bio-Logic Science Instruments, 
Grenoble, France). Singular value decomposition of the dataset provides an 
assessment of the number of spectrally distinct components present in the reaction. 
The data were fitted using irreversible models. 

Crystallization, refinement and model building. The HODM haem subunit was 
crystallized at 3 mg ml~! in 50mM KPi, 100 mM KCl, pH 7.5. Initial crystallization 
conditions were identified using the JCSG+ matrix screen (Molecular dimensions). 
Crystals suitable for diffraction experiments were obtained by sitting drop vapour 
diffusion at 4°C in 400 nl drops containing equal volumes of protein and a solution 
containing 30% PEG 2K MME and 0.1 M potassium thiocyanate. The crystals 
were derivatized by soaking with the same solution supplemented with 1 mM 
mercury (II) acetate for 10 min. For the DMA-bound structures, native crystals 
were soaked with 1 mM ligand for 10 min. For ferrous structures, the crystals were 
soaked with sodium dithionite followed by 1mM NaNO; and 1mM DMA for 
ferrous-DMA-nitric oxide bound structures. The crystals were cryoprotected 
by the addition of 10% PEG 200 to the mother liquor and flash-cooled in liquid 
nitrogen. Data were collected on beamline 124 (wavelength 0.9173/1.0118 A) at the 
Diamond Light Source Facility and reduced and scaled with the X-ray Detector 
Software suite (XDS*’). The ligand-free HODM haem subunit crystal structure 
was determined by SIRAS using the MLPHARE program in the CCP4 suite**’. In 
brief, the SIRAS signal extending to 1.86 A from the comparison of a platinum 
derivative with a mercury derivative with a figure of merit of 0.85, phasing power 
of 1.04centric/ 1] gacentric and Reutlis of 0.7°eMttic/g gacentric and anomalous was used for 
initial map calculation following density modification using the DM program in 
the CCP4 suite. A non-isomorphic native dataset with higher resolution (1.65 A) 
was used for refinement. Automated model building with BUCCANEER in the 
CCP4 suite successfully constructed the entire model of the ligand-free HODM 
haem subunit structure. The model was completed by iterative cycles of manual 
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model building and real space refinement using the program Coot and crystallo- 
graphic refinement using Refmac5. The processing and final refinement statistics 
are presented in Extended Data Table 1. 

Single-crystal microspectrophotometry. Changes in visible absorbance spectra 
during X-ray exposure were monitored at the European Synchrotron Radiation 
Facility (ESRF) in Grenoble on beamline ID 14-1 using an on-line spectrophotometer 
(OCEAN OPTICS DH 2000 light source and HR 2000 detector). AHODM a 
subunit crystal was soaked with sodium dithionite and DMA followed by transfer 
to mother liquor supplemented with DMA in the absence of dithionite to allow 
for oxygen complex formation. Initial spectra prior to X-ray illumination revealed 
formation of a species consistent with the ferrous-O,-DMA complex. 

Data availability. Coordinates and structure factors have been deposited in the 
Protein Data Bank under accession numbers 5LTE, 5LTH and 5LTI. 
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Extended Data Figure 1 | See next page for caption. 
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Extended Data Figure 1 | Catabolic pathway featuring HODM 

and additional HODM characterization in vivo and in vitro. 

a, Trimethylamine degradation pathway with links to C1 metabolism in 
proteobacteria. 1, trimethylamine monooxygenase; 2, trimethylamine 
N-oxide demethylase; 3, haem-dependent oxidative demethylase 
(HODM), also known as secondary amine monooxygenase (SAMO); 

4, methylamine dehydrogenase (EC 1.4.99.3); 5, methylamine oxidase 

(EC 1.4.3.21); 6, N-methylglutamate synthase (EC 2.1.1.21); 7, N- 
methylglutamate dehydrogenase (EC 1.5.99.5); 8, \-glutamylmethylamide 
synthetase (EC 6.3.4.12); 9, \- glutamylmethylamide-dissimilating 
enzyme; 10, methylene-H4F dehydrogenase/cyclohydrolase (EC 1.5.1.15); 
11, formyl-H4F deformylase/ formyl-H4F synthetase (EC 3.5.1.10); 12, 
formate dehydrogenase (EC 1.2.1.2). Formaldehyde can be assimilated 

via the ribulose monophosphate (RuMP) pathway or, via methylene- 
tetrahydrofolate, through formation of serine. The inset shows an SDS- 
PAGE gel of purified recombinant HODM, revealing bands corresponding 
to all four subunits. Additional bands are visible that appear to correspond 
to multimeric forms or proteolytic fragments of HODM subunits as 
verified by mass spectrometry of tryptic digests. b, Formaldehyde leakage 
from HODM in vivo. Efficient detoxification of formaldehyde requires 

the + subunit as well as tetrahydrofolic acid (THF). Cells containing wild- 
type and mutant HODM enzyme (Ay, deletion of subunit gamma) from 
Methylobacillus flagellatus (strain KT) were grown in the presence of DMA 
(grey bars) or DMA and glycine (shaded bars), respectively. Fluorescence 
readings were subsequently corrected for background activity and scaled 
with the mutant enzyme set as 100%. The data show that glycine, as well as 


LETTER 


the 4 subunit, have strong effects on enzymatic formaldehyde production. 
While the first increases intracellular levels of THE, the latter is required 
for THF binding. c, The HODM ferrous-oxy complex is long-lived at room 
temperature. The ferrous-oxy decay to the ferric state was monitored 

at 409 nm and plotted as a function of time. The decay curve observed 
was fitted using the exponential decay equation to derive a half-life of 

50 + 10 min with a decay rate of k=0.014+ 0.001 min~!. d, Determination 
of the redox potential of the HODM haem subunit. The main panel shows 
UV-visible spectra for the HODM haem domain during a redox titration. 
The oxidized enzyme (thick black line, spectrum recorded at —76 mV 
versus normal hydrogen electrode) has its Soret maximum at 409 nm. 

The fully reduced enzyme (thick line, spectrum recorded at —317 mV 
versus NHE) has its Soret maximum at 422 nm. Intermediate spectra are 
indicated in thin lines, and there is an isosbestic point at approximately 
415nm. The HODM haem domain also displays increased absorbance 

in the Q-band region (~520-620 nm) in the reduced (ferrous) state. The 
inset shows a plot of absorbance change (AA422 minus AA4o9) versus 
applied potential (versus NHE) fitted using the Nernst equation. This 
provides a midpoint reduction potential value for the HDOM haem 
domain Fe!!/Fe!! couple of 128 +4mV versus NHE. e, f, DMA binding to 
ferrous HODM (e) and DMA binding to ferrous-oxy HODM (f). 
Absorbance changes (as shown in the inset) as a function of DMA 
concentration are fitted using a hyperbolic function/Morrison equation, 
leading to a Kg=1.7 £0.2 mM for ferrous HODM (ferric HODM has a 

Kg of 1.5+0.1mM, data not shown) and a Kg= 15 +3 1M for ferrous-oxy 
HODM (errors bars are s.e.m., 1 = 3). 
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Extended Data Figure 2 | X-band continuous-wave electron 
paramagnetic resonance spectra of anaerobically purified HODM. 

a, Anaerobically purified HODM exposed to air for two minutes. 

b, The HODM enzyme shown in a with the addition of excess DMA. 

c, Anaerobically purified HODM. d, The HODM enzyme shown in ¢ with 
the addition of excess DMA. Experimental parameters: microwave power, 
0.5 mW; modulation amplitude, 3 G; temperature, 30 K. e, HODM alone. 


3150 3200 3250 3300 3350 3400 3450 3500 3550 3600 
Field (G) 


f, HODM plus excess DMA. g, HODM plus excess diethylamine. 

h, HODM plus excess piperidine. Experimental parameters: microwave 
power, 0.5 mW; modulation amplitude, 3 G; temperature, 30 K. Only 

DMA (f) shows the loss of the '4N His superhyperfine coupling in the 
ferrohaem-—NO component of the spectrum and a significant change in the 
g values exhibited by the [2Fe-2S]'* component of the spectrum. 
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Extended Data Figure 3 | HODM cannot make use of peroxide. Spectral changes observed following mixing of 101M ferric HODM with 200 |tM H20o. 
The haem Soret peak (black line) decays upon H,O, addition (200|1M) (thin grey lines each after 30s) with further decay at 5 min incubation (thin black 
line) and 60 min (black dotted line). The inset shows a similar experiment conducted in the presence of 2mM DMA. 
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Extended Data Figure 4 | The HODM haem subunit has similar 
properties to the HODM holoenzyme. a, UV-visible absorption features 
of alpha HODM (8.5 1M). Ferric (thick solid line), ferrous (thin solid 

line) Fe?*-O, (dotted line) and Fe?*-CO (dashed line). The major (Soret) 
absorption band is centred at 409, 423, 418 and 421 nm, respectively. The 
a/ bands are magnified in the inset. b, Titration of ferric and ferrous-oxy 
HODM a subunit with DMA. Absorbance changes associated with the 
haem Soret peak are plotted as a function of DMA concentration. The data 
are fitted using a hyperbolic function or the Morrison equation. Top, ferric 
with Ky= 12.6 + 0.2 mM. Bottom, ferrous-oxy with Kq=41.0 £5.01.M 
(errors bars are s.e.m., 1 = 3). Insets show UV-vis spectra of the titration 
of DMA (0-50 mM) against ferric and ferrous-oxy HODM a subunit 
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(7.5/4.8 1M). The UV-vis spectrum of « in buffer A was recorded initially 
and then after each addition of DMA. The direction of the absorbance 
changes of the ferric and ferrous-oxy Soret peak on substrate addition 

is measured from 409/415 nm (thick line) to 416/414 nm (dotted line), 
respectively. c—e, Spectral properties of HODM haem subunit variants 
E266Q, W180A and R224A, respectively. Ferric (thick black line), ferric in 
presence of 1mM DMA (thick grey line), ferrous (thin black line), ferrous 
in presence of !mM DMA (thin grey line) and Fe**—O, (dotted line). 

The a/8 bands are magnified in the inset. While the E226Q variant lacks 
spectroscopic features associated with DMA binding, the R224A variant 
appears to bind DMA as a 6th ligand to the haem iron in the ferrous state. 
Oxygen binding can be detected for all variants. 
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Extended Data Figure 5 | Possible routes for DMA oxidation in HODM. 
Following formation of compound 0, various possibilities exist for O-O 
bond cleavage in principle. A direct hydride transfer from the substrate 

is shown in A, bypassing the need for higher oxidation states of the haem 
Fe. Route B depicts the homolytic bond cleavage with formation of 
compound II and an amine radical. Route C depicts a P450-like 


mechanism, with formation of the highly oxidizing compound I. Both the 
B and C routes can give rise to various products. The N-oxide product is 
not observed with DMA as a substrate, which results in stoichiometric 
production of formaldehyde in vitro. This can be derived from either the 
iminium or hydroxylamine products. 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


0.06 
0.04 


0.02 


Absorbance 


0.00 


-0.02 


500 600 700 
Wavelength (nm) 


Extended Data Figure 6 | X-ray-induced photoreduction of the HODM 
haem subunit in complex with DMA and O). The initial spectrum 
recorded before X-ray illumination (ESRF beamline ID14-1 (wavelength 
0.934 A)) was subtracted from the post illumination spectrum. A shift in 
the a/6 bands is recorded with a decrease at 575 nm and an increase at 
556nm. This corresponds to formation of the ferrous species (merged 
a/B band at 556 nm). 
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Extended Data Figure 7 | MA-dependent NADPH oxidation by Ky=0.55 £0.04 mM. Inset shows Purpald colourimetric assay for 
HODM in absence of formaldehyde production. The kinetic curve detection of substrate-dependent formaldehyde production. Data are 
(mean values of initial rate plotted as a function of MA concentration) represented as a percentage of expected formaldehyde produced for 1:1 
was fitted using the Michaelis-Menten function k= 7.0 + 0.1s~! and conversion of DMA/MaA to formaldehyde. Error bars represent s.d. 
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(0.00000) (0.00000) (0.00000) (0.00000) 
(3) radical cation -134.33122 -95.15169 -134.76707 -95.45484 
(0.34739) (0.36998) (0.34291) (0.36753) 
(4) protonated -133.83615 -94.66587 -134.25113 -94.94862 
imine (0.84246) (0.85580) (0.85884) (0.87375) 
(5) deprotonated -133.49597 -94.33499 -133.90958 -94.61696 
imine (1.18263) (1.18669) (1.20040) (1.20542) 


Extended Data Figure 8 | Gas phase density functional theory and 

ab initio comparison of the energies of DMA- and MA-based reaction 
intermediates. Quoted energies in the table are the sum of electronic and 
thermal enthalpies for structures in the gas phase at 298.15 K with units 
of hartrees. Values in parentheses are the difference relative to the neutral 
amine. The relative energies of the DMA species are lower than those of 


the MA species by 10-65 kJ mol”! (0.04-0.025 hartrees). The pKa values 
of DMA (R= CH) and MA (R= H) (1) are very similar (10.64 and 
10.62 respectively). Calculations were performed at both the 
(U)B3LYP/6-311++G(d,p) and (U)MP2/6-311++G(d,p) level 

using the ‘Freq’ keyword in Gaussian 09 rev. B.01. 
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Extended Data Figure 9 | Multiple sequence alignment of HODM 
including Aer2 PAS domain secondary structure information. 
Alignment of HODM proteins from the DUF3445 family from the 
bacteria: Pseudomonas mendocina (Pmen), Rhizobium etli (Retl), 
Candidatus Puniceispirillum marinum (Cpun) and Gordonia bronchialis 
(Gbro), and from the fungi Candida dubliniensis (Cdub) and Nectria 


haematococca (Nhae). The catalytic Arg, Glu and Gln residues and the 
substrate-discriminating Trp are marked with black dots. The conserved 
haem-binding His in Pmen and Aer2 are marked with a grey and a black 
asterisk, respectively. Secondary structural elements are highlighted, 
PAS-domain-specific «-helices in blue and 8-sheets in pink, and HODM- 
specific a-helices in pale grey and 3-sheets in dark grey. 
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Extended Data Table 1 | Data collection and refinement statistics for P mendocina HODM a subunit 


HODMalpha Ferric Ferric-DMA Ferrous-DMA-NO 
PDB code 5LTE 5LTH 5LTI 
Data collection 
Space group P 43212 P 43 21 2 P 43 212 
Cell dimensions 
a,c (A) 79.91, 144.71 80.38, 144.79 80.21, 144.61 
Resolution (A) 26.82-1.65 56.84-1.76 28.36-1.90 
(1.70-1.65) (1.81-1.76) (2.01-1.90) 
Boi 0.034 0.027 0.057 
(0.223) (0.211) (0.306) 
Mol 15.4 18.2 13.7 
(3.8) (3.5) (4.5) 
Completeness (%) 99.93 99.91 100 
(100) (100) (100) 
Redundancy 12.1 12.9 12.0 
(12.2) (12.2) (12.0) 
Refinement 
Resolution (A) 26.82-1.65 56.84-1.76 28.36-1.90 
(1.70-1.65) (1.81-1.76) (2.01-1.90) 
No. reflections 53715 45331 36266 
(3876) (3261) (2623) 
Bwow Ries 0.163/0.194 0.168/0.187 0.173/0.201 
(0.198/0.248) (0.241/0.262) (0.21/0.28) 
No. atoms 3134 3066 3004 
B-factors (A2) 15.86 20.76 19.82 
R.m.s deviations 
Bond lengths (A) 0.024 0.028 0.025 
Bond angles (°) 2.23 2.42 2.11 


Values in parentheses correspond to the outer resolution shell. 
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In this Letter, we have identified an important error affecting Fig. 4 
and Extended Data Fig. 6, as well as the values of some parameters 
derived from our model fits. We stress that this error in no way affects 
the discussion or the conclusions. In building the bolometric light 
curve of the superluminous supernova PTF 12dam, our code assumed 
that photometry from the Swift satellite was calibrated in the Vega 
magnitude system. However, our photometry was actually calibrated 
to the AB magnitude system (and published in the AB system in the 
original paper). This led to an underestimate of approximately 50% in 
the measured peak luminosity of PTF 12dam. 

Here we present updated figures and model fits with the correct 
bolometric luminosity. To construct the bolometric light curve, we 
transformed the Swift data into Vega magnitudes, and then converted 
all photometry to fluxes. At epochs with the full range of UVW2 to 
K band, we simply integrated over the observed spectral energy 
distribution. At epochs with missing filters, we accounted for the 
unobserved flux by fitting blackbodies to the available data. We also 
compared our blackbody extrapolations against polynomial fits to the 
ultraviolet and near-infrared light curves, finding consistent results. 

This should be more reliable than our previous extrapolation method, 
which assumed linear colour evolution over 40 days. If anything, the 
improved bolometric light curve strengthens our main conclusion— 
that PTF 12dam was not a pair-instability supernova—because the 
brighter light curve peak results in an even steeper rise to maximum. 
It is important to note that the large discrepancy compared to pair- 
instability models does not rely solely on bolometric comparisons: the 
difference was clearly apparent in the r-band light curves in our original 
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Fig. 1. Therefore this is a robust result independent of any time-varying 
bolometric correction. 

Our secondary conclusion—that spin-down of a nascent magnetar 
can satisfactorily explain the observed properties—also remains 
intact. The parameters of our magnetar-powered fit to the corrected 
bolometric light curve shown in Fig. 1 of this Corrigendum (the 
corrected original Fig. 4) remain within a sensible range and are as 
follows: magnetic field B=5 x 10% G; spin period P=2.3 ms; and 
ejecta mass M.; = 7Mo (where Mg is the solar mass) for an opacity 
K=0.1cm gt and explosion energy E= 10°! erg. Our suggestion 
that a relatively lower spin period and larger ejected mass can explain 
the existence of these long-duration superluminous supernovae is 
unchanged. 

In Extended Data Fig. 6, we showed that decay of radioactive 
nickel-56 could not explain the observed light curve. This remains true 
for the corrected light curve, which is shown in the Supplementary 
Information to the Corrigendum. The unrealistic parameters required 
to model the data with nickel as the power source are listed below the 
figure. 

We thank P. Vreeswijk for initially pointing out a discrepancy 
between our light curve and his own results. M.N. identified the source 
of the discrepancy. The original Letter has not been corrected. 


Supplementary Information is available in the online version of the 
Corrigendum. 


Correspondence and requests for materials should be addressed to M.N. 
(matt.nicholl@cfa.harvard.edu). 
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Figure 1 | This is the corrected Fig. 4 of the original Letter. PISN, 
pair-instability supernova. Error bars are 1¢, as in the original Letter. 
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Former climate scientist Jess Spear (third from left), now an organizer for the activist group Socialist Alternative, campaigns on the streets of Seattle in 2014. 


A call to serve 


The political arena need not be off-limits to scientists. 


BY CHRIS WOOLSTON 


s if full-time research werent time- 
Acie and challenging enough, 
nanophysicist Michael Stopa embraced 
a second occupation while at the bench: 
politics. He served as a delegate for US presi- 
dent-elect Donald Trump at this year’s Republi- 
can National Convention. Before that, while he 
was a senior scientist at Harvard University in 
Cambridge, Massachusetts, he blew his cover as 
a semi-secret conservative by running unsuc- 
cessfully as a Republican for the US Congress 
in 2010 and again in 2013. “My face was on the 
front page of the Harvard Crimson,’ he says 
of the university's student newspaper. “At that 
point, I was exposed’ 
Stopa, who now works at a technology 


startup near Boston, Massachusetts, says that 
his outspoken politics have cost him at least 
one close professional collaboration — and 
maybe more — but that hasn't quietened him. 
He still talks politics on the Harvard Lunch 
Club weekly podcast. In each session he takes 
part in, he discusses his conservative views, 
including his belief that illegal immigration 
threatens the United States. 

The acrimonious US presidential election 
is over, but politics are forever, and Stopa 
isn't the only scientist joining the fray. Many 
researchers take public political stands on 
Twitter and elsewhere, and some are engaging 
with political parties or running for office (see 
Join the party’). Politically active scientists can 
struggle to find the time and energy to bridge 
both worlds, and there's always the risk that 


an unpopular stand could cause friction. But 
there are also benefits: politics can provide 
another avenue for networking and outreach. 
And, ideally, scientists will be able to give 
governments the kind of input needed to pro- 
duce informed policy. Political involvement 
can also create a sense of real-world accom- 
plishment that is sometimes hard to find in the 
lab. “Nothing’s more rewarding than combin- 
ing the two passions,’ says David Mazzocchi- 
Jones, a neuroscientist at Keele University, UK, 
and a member of the local Labour Party. 
Despite the opportunities, few scientists 
have reached high office in government. 
Frauke Petry, chairwoman of the right-wing 
Alternative for Germany Party, has a chemis- 
try PhD, as does chancellor Angela Merkel. Of 
535 members of the US House and Senate, > 
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JOIN THE PARTY 


Getting political 


Want to get involved in politics? Here’s how 
to take the plunge. 

@ Step forward. Scientists willing to 
volunteer their time and talent are always 

in demand. “If you go to any party office 
anywhere in the world and say that you 
really want to help out, they aren’t going 

to say no,” says David Mazzocchi-Jones, 

a neurologist at Keele University, UK, who 

is active in the UK Labour Party. “Or call 

up a local politician and offer to help. No 
politician will turn you away.” 

@ Find your issue. Jess Spear, a former 
climate scientist who is now an organizer for 
activist group Socialist Alternative in Seattle, 
Washington, recommends campaigning 

for a local issue even if it’s not specifically 


> just two congressmen — a physicist and an 
engineer — have PhDs in the hard sciences. 
The UK-based Campaign for Science and 
Engineering counts 90 Members of Parlia- 
ment who have at least some background or 
interest in the sciences, engineering or medi- 
cine, including Thérése Coffey, who has a PhD 
in chemistry. That’s down from 103 science- 
minded MPs in the previous parliament. 
“Scientists are very under-represented in poli- 
tics in the UK,” Mazzocchi-Jones says. “Twenty 
years ago, there were quite a few more.” 

Researchers who manage to break into the 
political world could have a huge impact on 
policy, says Jeff Schweitzer, a former marine 
biologist who worked as a science-policy ana- 
lyst for the US Clinton administration in the 
1990s. “The biggest thing that a scientist brings 
is a method of thinking,’ he says. “They have a 
vocabulary that non-scientists might not have.” 
Scientists in government can help to bridge the 
gap between policymakers and the researchers 
who study, in great detail, how the real world 
actually works, he adds. 

Mazzocchi-Jones, a Labour councillor for 
Newcastle-under-Lyme, believes that his sci- 
ence background has helped him to handle the 
issues that matter to his constituents. “When 
were deciding ona new recycling system, I can 
say, ‘Show me the numbers,” he says. 

Governments are increasingly facing critical 
issues such as climate change and fracking 
(hydraulic fracturing) that call for scientific 
wisdom, says David Dunbar, a bioinformati- 
cian at the University of Edinburgh, UK, who 
is active with the Scottish National Party. “The 
scientists you see in the UK Parliament seem to 
be thinking in an evidence-based way, and that’s 
a positive,’ he says. “The party line isn't always 
evidence-based. And neither is public opinion” 

Scientists who aren't themselves politically 
active can still do their bit to keep politicians 


science-related. If you’re passionate 

about an increase in the minimum wage 

or a new tax, then making phone calls or 
writing letters could be a rewarding political 
introduction. 

@ Get in the race. Win or lose, campaigning 
iS a proven way to spread your message. 
Just be ready for the verdict. “You have to 
have a thick skin,’ Mazzocchi-Jones says. 
“Scientists are used to rejection, but getting 
rejected by 40,000 people is different.” 

@ Join an activist group — but only if you’re 
ready to be a team member. “It’s important 
that educated people come to activism with 
humility,’ Spear says. “You are there to build 
with others, not teach them, and you are not 
an expert in the political arena.” C.W. 


informed, even if only through a quick e-mail 
or achat with a local representative. “We need 
to engage more with politicians,” Mazzocchi- 
Jones says. “It’s not going to get us anywhere 
unless we talk to them directly.” He says that 
his interest in politics was rekindled in 2014 
when he helped to organize the UK Physiologi- 
cal Society’s Engaging with Parliamentarians 
outreach programme, where he and other sci- 
entists paired up with politicians to exchange 
ideas. He says that both sides must find ways 
to identify common ground. “Scientists have to 
step forward and be recognized,’ he says, “and 

politicians have to listen”. 
Politics can be a sticky subject, however, espe- 
cially when someone is out of step with their 
colleagues. Stopa says 


“We need to he felt some tension 
engage more at Harvard, and not 
with politicians. just with the friend 
It’s not goin g to and collaborator who 
et us anywhere severed ties with him. 
8 4 When Stopa’s con- 

unless we talk to 5 
tract wasn't renewed, 


them directly.” 


he was eager to move 
on. “It’s hard to be sur- 
rounded by people with different ideologies” 

Stopa doesn't regret publicly announcing his 
conservatism, but he understands why some 
conservatives prefer to keep quiet. “There’s an 
ongoing debate about whether or not to come 
out of the closet,’ he says. “You have to make 
that decision for yourself. If you think it might 
negatively affect your career, you might be 
better off not saying these things.” 

Schweitzer sees political activism as a right. 
No one in science should be afraid to put their 
politics on display, he says. “If that’s an obsta- 
cle,’ he adds, “you're not at the right institution” 

Sometimes, political anonymity isn’t much 
of an option. “A lot of my colleagues and stu- 
dents live in my ward,’ says Mazzocchi-Jones. 
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“I know for a fact that they’ve received leaf- 
lets with my face on them.” And his dual roles 
occasionally collide in awkward ways. “Col- 
leagues will tell me, ‘My bin didn't get collected 
last week,” he adds, by way of example. 

So compelling is political work for some 
scientists that they turn it into a full-time pro- 
fession. Stacey Danckert, who has a PhD in 
cognitive neuroscience from the University 
of Waterloo, Canada, declined a prestigious 
two-year grant from the Alzheimer Society of 
Canada in 2013 because she found it tough to 
balance research, politics and family commit- 
ments. “I decided to follow my passion for the 
environment,’ she says. 

She left the lab and is now policy coordinator 
for the Green Party of Ontario and a twice- 
unsuccessful Green Party candidate for the 
Provincial Parliament of Ontario. In her view, 
it’s almost impossible for a scientist to run for 
political office while staying in the lab. “Tt’s 
important to get your name out, and you can't 
do that without spending a lot of time,’ she says. 
“The two pursuits require endless dedication.” 

Similarly, Jess Spear, a former climate sci- 
entist who worked at the Burke Museum of 
Natural History and Culture in Seattle, Wash- 
ington, left research to join Socialist Alterna- 
tive, a socialist activist group, in 2011. After 
running unsuccessfully for the Washington 
state House of Representatives in 2014, she is 
now a full-time organizer for the group. “The 
more I got involved in climate science,” she 
says, “the more I became aware that we dont 
just need more data. We need political will” 

Schweitzer believes that scientists who can 
handle university politics have the mettle to 
excel at local, regional and national politics. 
“The skills are very transferable,” he says. “You 
have to show that you can get along with peo- 
ple, and you have to build networks” 

Perhaps most importantly, scientists tend 
to have a track record of working with large 
bureaucracies. “You need to be able to manipu- 
late the system to your will to get things done,” 
he says. “If you tend to get frustrated and just 
throw up your hands, politics probably isn’t 
for you.” 

Before jumping into politics, Schweitzer 
briefly ran his own lab at the University of 
California, Irvine, an experience that he says 
was invaluable in his second career. “In order 
to have credibility in Washington DC, you have 
to have had at least a short career as a lab scien- 
tist,” he says. “I wasn’t in the lab very long, but 
in their view I was a real scientist.” 

Mazzocchi-Jones manages, for the most 
part, to keep his work separate from his ideol- 
ogy. He says that he has a student who is an 
outspoken supporter of the UK Independence 
Party, a right-wing, anti-immigration party. “I 
find his politics abhorrent,’ Mazzocchi-Jones 
says. “But in the end, science unites us.” = 


Chris Woolston is a freelance writer in 
Billings, Montana. 
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HUNGRY GHOSTS 


BY CASSANDRA KHAW 


r | hey keep the oldest ghosts in little 
diskettes, squares of ageing steel 
and red plastic. Except not really, 

of course. Human consciousness is 
notoriously hard to optimize. You 
could trim the fat in theory, Xiao Lin 
thinks, as she dabs carmine on the slant 
of her cheekbones, contouring them to 
knives. Delete swathes of summertime 
screw-ups, summarize romances, con- 
dense college to empirical scores. But 
then youd lose the person in the pro- 
cess. Youd lose the soul. 

Better to empty the diskettes and 
cram their corpses with solid-state 
drives, keep the joke and the personal- 
ity. Otherwise, what's the point? Might 
as well skip the getai and head back up 
to the stars. 

“But it’s creepy, don't you think?” Her 
co-host this year is indistinguishable 
from the last: petulant, slim, haunt- 
ingly young, with pencil-stroke arms 
and dense, dark hair. The girl pins gold 
hoops to her ears and makes a moue at 
her reflection, earrings glittering as she 
tilts her chin one way and another, the 
light catching on silver-moted lashes. 
“Performing for dead people like that. We 
have the technology. Why don’t we just 
schedule something for them in VR?” 

Xiao Lin sighs. It’s always the same. Natu- 
ralize in space and you anchor in the stars, 
forget what it means to be Chinese, immi- 
grant-born and aching, your spine bent with 
ghosts, half-sick with longing for a place 
your children can call home. 

“Because they can tell? she chides, not 
quite lying. Realistic conjecture, Xiao Lin 
reassures herself, smearing malachite over 
her eyelids. Still, who knows if the Als can 
distinguish between fact and fabrication, can 
differentiate between polynomial renders 
and the jewelled sea, the saltwater breeze? 

“Can they, though?” 

Another layer of mascara, another coat- 
ing of liner. Xiao Lin spirals her lipstick back 
into its casing. A resolute click. 

Yes, she thinks. Absolutely. 

But what she says is: “Do you want their 
families to ask for their money back?” 

The invocation of 


> NATURE.COM money buries any 
Follow Futures: counter-argument. 

© @NatureFutures The two return to 
Ei go.nature.com/mtoolm = their preparations, 


Homing signal. 


silent. Outside, the orchestra tunes its 
instruments, performs chords in bright 
arpeggios. A thrum of laughter escapes as 
someone reels off a joke that Xiao Lin can 
almost hear. And for a heartbeat, she is back 


on Europa in a house teeming with relatives, 
the air rich with spices, a Chinese New Year 
comedy playing too loudly ona salvaged TV. 

“T dont think theyd ask for their money 
back.” The girl — Bao Bao or Mei Mei or 
something equally infantile — grumbles an 
interruption, timbre slightly defiant, as she 
secures her hair in double loops. “As long as 
their relatives are happy.’ 

Xiao Lin rises, red silk frothing along the 
slope of her hips. She picks a buyao — jade 
peonies dangling from a filigreed twig — 
from the dressing table and inserts it into the 
girl's hair. In the dim light behind the stage, 
they could be sisters, features simplified into 
stereotype, unified by skin. “Would you be 
happy, though?” 

“What?” 

“Lying to them like that.’ 

“To whom? The dead people?” 

“Yes.” 

An exasperated noise. “I mean, they’re 
dead —” 

“And at some point, they weren't. They 
were alive and afraid of the idea that theyd 
be forgotten, so afraid that they made copies 
of themselves. Just to live on. Just to say that 
a part of them will always be here.” Xiao Lin 
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works pins into her co-host’s qipao, moulds 
the dress to the girl’s silhouette, smaller and 
slighter than that of the original owner. 

“T still don’t understand. What’s your 
point? Even if we just installed them into a 
simulation, they wouldn't know” 

“But wed know.’ 

Clack. The sound of teeth coming 
together, like a coffin pulled shut. 

No more argument. Only a look 
reflected in the mirror, bottomless, 
older than the girl’s kohl-framed eyes, 
her lineless throat. Xiao Lin pats her 
on the shoulder before she drifts away, 
weaving between production assistants 
and extras, gofers and camera women, 
heels clicking a eulogy. 

At the curtains, she lounges against a 
pillar and tweaks apart the fabric. The 
audience is steeped in apparitions, 
holograms of varying sophistication, 
projected onto glass and ambient mois- 
ture. Interspaced in between are crim- 
son-clad humans, marrow and bone, 
middlemen facilitating long-distance 
conversations. 

Xiao Lin winds restless fingers 
around the loop of gold at her throat. 
Twenty thousand years and a fortune 
of galaxies at their fingertips and still, 
all they want to do is go home. So much 
money poured into the restoration of 
Earth, salving its scars, decontaminating 
the waters, restoring the skyline. So much 
effort for something they4d never see, a hal- 
cyon dream they clutched while the skies 
boiled with ash. 

And for what? A memory carried through 
generations, like a song passed down from 
mother to daughter. A chance to let their 
ghosts sit and laugh and clap at perfor- 
mances theyd never experience, only wit- 
ness by proxy. Was it worth it? 

A hand finds the crook of Xiao Lin’s elbow. 
She turns to see her co-host, grinning, iri- 
descent with cameras. You can go anywhere, 
Xiao Lin muses, if you know there’s some- 
where to go home to. And maybe, that was 
the point all along. 

“Ready?” 

“Ready.” Xiao Lin smiles and lifts the cur- 
tain, steps out to the roar of the surf and the 
applause of the ancestral dead. = 


Cassandra Khaw is the business developer 
for micropublisher Ysbryd Games. When 
not otherwise writing fiction, she punches 
things and occasionally, people. 
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